GoSGD: Distributed Optimization for Deep Learning with Gossip Exchange

Abstract

We address the issue of speeding up the training of convolutional neural networks by studying a distributed method adapted to stochastic gradient descent. Our parallel optimization setup uses several threads, each applying individual gradient descents on a local variable. We propose a new way of sharing information between different threads based on gossip algorithms that show good consensus convergence properties. Our method called GoSGD has the advantage to be fully asynchronous and decentralized.

BibTeX key: blot2018gosgd
entry type: misc
year: 2018
url: http://arxiv.org/abs/1804.01852
note: cite arxiv:1804.01852

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

BibSonomy

GoSGD: Distributed Optimization for Deep Learning with Gossip Exchange

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on