The demand for artificial intelligence has grown significantly over the last
decade and this growth has been fueled by advances in machine learning
techniques and the ability to leverage hardware acceleration. However, in order
to increase the quality of predictions and render machine learning solutions
feasible for more complex applications, a substantial amount of training data
is required. Although small machine learning models can be trained with modest
amounts of data, the input for training larger models such as neural networks
grows exponentially with the number of parameters. Since the demand for
processing training data has outpaced the increase in computation power of
computing machinery, there is a need for distributing the machine learning
workload across multiple machines, and turning the centralized into a
distributed system. These distributed systems present new challenges, first and
foremost the efficient parallelization of the training process and the creation
of a coherent model. This article provides an extensive overview of the current
state-of-the-art in the field by outlining the challenges and opportunities of
distributed machine learning over conventional (centralized) machine learning,
discussing the techniques used for distributed machine learning, and providing
an overview of the systems that are available.
Описание
[1912.09789] A Survey on Distributed Machine Learning
%0 Generic
%1 verbraeken2019survey
%A Verbraeken, Joost
%A Wolting, Matthijs
%A Katzy, Jonathan
%A Kloppenburg, Jeroen
%A Verbelen, Tim
%A Rellermeyer, Jan S.
%D 2019
%K 2019 arxiv distributed-systems machine-learning survey
%T A Survey on Distributed Machine Learning
%U http://arxiv.org/abs/1912.09789
%X The demand for artificial intelligence has grown significantly over the last
decade and this growth has been fueled by advances in machine learning
techniques and the ability to leverage hardware acceleration. However, in order
to increase the quality of predictions and render machine learning solutions
feasible for more complex applications, a substantial amount of training data
is required. Although small machine learning models can be trained with modest
amounts of data, the input for training larger models such as neural networks
grows exponentially with the number of parameters. Since the demand for
processing training data has outpaced the increase in computation power of
computing machinery, there is a need for distributing the machine learning
workload across multiple machines, and turning the centralized into a
distributed system. These distributed systems present new challenges, first and
foremost the efficient parallelization of the training process and the creation
of a coherent model. This article provides an extensive overview of the current
state-of-the-art in the field by outlining the challenges and opportunities of
distributed machine learning over conventional (centralized) machine learning,
discussing the techniques used for distributed machine learning, and providing
an overview of the systems that are available.
@misc{verbraeken2019survey,
abstract = {The demand for artificial intelligence has grown significantly over the last
decade and this growth has been fueled by advances in machine learning
techniques and the ability to leverage hardware acceleration. However, in order
to increase the quality of predictions and render machine learning solutions
feasible for more complex applications, a substantial amount of training data
is required. Although small machine learning models can be trained with modest
amounts of data, the input for training larger models such as neural networks
grows exponentially with the number of parameters. Since the demand for
processing training data has outpaced the increase in computation power of
computing machinery, there is a need for distributing the machine learning
workload across multiple machines, and turning the centralized into a
distributed system. These distributed systems present new challenges, first and
foremost the efficient parallelization of the training process and the creation
of a coherent model. This article provides an extensive overview of the current
state-of-the-art in the field by outlining the challenges and opportunities of
distributed machine learning over conventional (centralized) machine learning,
discussing the techniques used for distributed machine learning, and providing
an overview of the systems that are available.},
added-at = {2019-12-24T06:06:41.000+0100},
author = {Verbraeken, Joost and Wolting, Matthijs and Katzy, Jonathan and Kloppenburg, Jeroen and Verbelen, Tim and Rellermeyer, Jan S.},
biburl = {https://www.bibsonomy.org/bibtex/24b70cdadea3badde99e49701b8fe2674/analyst},
description = {[1912.09789] A Survey on Distributed Machine Learning},
interhash = {b6bbbb3a1a96db22f4d24bda5bf1820e},
intrahash = {4b70cdadea3badde99e49701b8fe2674},
keywords = {2019 arxiv distributed-systems machine-learning survey},
note = {cite arxiv:1912.09789},
timestamp = {2019-12-24T06:06:41.000+0100},
title = {A Survey on Distributed Machine Learning},
url = {http://arxiv.org/abs/1912.09789},
year = 2019
}