Inproceedings,

Deep Learning is not a Matter of Depth but of Good Training

B. Barz, and J. Denzler.
International Conference on Pattern Recognition and Artificial Intelligence (ICPRAI), page 683--687. CENPARMI, Concordia University, (2018)

Full text

Abstract

In the past few years, deep neural networks have often been claimed to provide greater representational power than shallow networks. In this work, we propose a wide, shallow, and strictly sequential network architecture without any residual connections. When trained with cyclical learning rate schedules, this simple network achieves a classification accuracy on CIFAR-100 competitive to a 10 times deeper residual network, while it can be trained 4 times faster. This provides evidence that neither depth nor residual connections are crucial for deep learning. Instead, residual connections just seem to facilitate training using plain SGD by avoiding bad local minima. We believe that our work can hence point the research community to the actual bottleneck of contemporary deep learning: the optimization algorithms.

BibTeX key: Barz18:GoodTraining
entry type: inproceedings
booktitle: International Conference on Pattern Recognition and Artificial Intelligence (ICPRAI)
year: 2018
pages: 683--687
publisher: CENPARMI, Concordia University
venue: Montreal, Canada
isbn: 1-895193-06-0
Document: http://hera.inf-cv.uni-jena.de:6680/pdf/Barz18:GoodTraining.pdf

Users

Comments and Reviewsshow / hide

@s364315 3 years ago
Reference for Plain11 neural network architecture
References
Bookmarks
deleting comment

Please log in to take part in the discussion (add own reviews or comments).

BibSonomy

Deep Learning is not a Matter of Depth but of Good Training

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on