Author of the publication

Three Factors Influencing Minima in SGD

, , , , , , and . (2017)cite arxiv:1711.04623Comment: First two authors contributed equally. Short version accepted into ICLR workshop.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

VIM: Variational Independent Modules for Video Prediction., , , , and . CLeaR, volume 177 of Proceedings of Machine Learning Research, page 70-89. PMLR, (2022)A new point process model for trajectory-based events annotation., , and . Image Processing: Machine Vision Applications, volume 8300 of SPIE Proceedings, page 83000B. SPIE, (2012)Finding Flatter Minima with SGD., , , , , , and . ICLR (Workshop), OpenReview.net, (2018)Lookahead Converges to Stationary Points of Smooth Non-convex Functions., , , and . ICASSP, page 8604-8608. IEEE, (2020)Modeling Caption Diversity in Contrastive Vision-Language Pretraining., , , , , , and . ICML, OpenReview.net, (2024)VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning., , , , , , and . CoRR, (2024)Scaling Language-Free Visual Representation Learning., , , , , , , , , and 1 other author(s). CoRR, (April 2025)DNN's Sharpest Directions Along the SGD Trajectory, , , , , and . (2018)CEA LIST at TRECVID 2012 : Semantic Indexing and instance search., , , and . TRECVID, National Institute of Standards and Technology (NIST), (2012)Improved Conditional VRNNs for Video Prediction., , and . ICCV, page 7607-7616. IEEE, (2019)