копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Escaping Saddles with Stochastic Gradients

H. Daneshmand, J. Kohler, A. Lucchi, и T. Hofmann. (2018)cite arxiv:1803.05999.

Аннотация

We analyze the variance of stochastic gradients along negative curvature directions in certain non-convex machine learning models and show that stochastic gradients exhibit a strong component along these directions. Furthermore, we show that - contrary to the case of isotropic noise - this variance is proportional to the magnitude of the corresponding eigenvalues and not decreasing in the dimensionality. Based upon this observation we propose a new assumption under which we show that the injection of explicit, isotropic noise usually applied to make gradient descent escape saddle points can successfully be replaced by a simple SGD step. Additionally - and under the same condition - we derive the first convergence rate for plain SGD to a second-order stationary point in a number of iterations that is independent of the problem dimension.

Описание

Escaping Saddles with Stochastic Gradients

Линки и ресурсы

ключ BibTeX: daneshmand2018escaping
тип записи: misc
год: 2018
url: http://arxiv.org/abs/1803.05999
Примечание: cite arxiv:1803.05999

тэги

@jk_itwm- тэги данного пользователя выделены

Цитировать эту публикацию

искать в

Метаданные

Последнее изменение 7 лет назад
Создан 7 лет назад

Комментарии и рецензии
(0)

Комментарии, или рецензии отсутствуют. Вы можете их написать!