Author of the publication

Multiplicative Noise and Heavy Tails in Stochastic Optimization.

, and . ICML, volume 139 of Proceedings of Machine Learning Research, page 4262-4274. PMLR, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Second-order optimization for non-convex machine learning: An empirical study, , and . arXiv preprint arXiv:1708.07827, (2017)Reading group.GPU Accelerated Sub-Sampled Newton's Method for Convex Classification Problems., , , and . SDM, page 702-710. SIAM, (2019)Rethinking generalization requires revisiting old ideas: statistical mechanics approaches and complex learning behavior, and . (2017)cite arxiv:1710.09553Comment: 31 pages; added brief discussion of recent papers that use/extend these ideas.Evaluating OpenMP Tasking at Scale for the Computation of Graph Hyperbolicity., , , and . IWOMP, volume 8122 of Lecture Notes in Computer Science, page 71-83. Springer, (2013)Robustifying State-space Models for Long Sequences via Approximate Diagonalization., , , , and . CoRR, (2023)Randomized Dimensionality Reduction for k-Means Clustering., , , and . IEEE Trans. Inf. Theory, 61 (2): 1045-1062 (2015)Big Little Transformer Decoder., , , , , and . CoRR, (2023)Generative Modeling of Regular and Irregular Time Series Data via Koopman VAEs., , , , and . CoRR, (2023)Parallel and Communication Avoiding Least Angle Regression., , , , and . CoRR, (2019)MLPruning: A Multilevel Structured Pruning Framework for Transformer-based Models., , , , and . CoRR, (2021)