Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Entropy-SGD optimizes the prior of a PAC-Bayes bound: Generalization properties of Entropy-SGD and data-dependent priors., and . ICML, volume 80 of Proceedings of Machine Learning Research, page 1376-1385. PMLR, (2018)On the Information Complexity of Proper Learners for VC Classes in the Realizable Case., , , and . CoRR, (2020)NUQSGD: Provably Communication-efficient Data-parallel SGD via Nonuniform Quantization., , , , , and . J. Mach. Learn. Res., (2021)The Shaped Transformer: Attention Models in the Infinite Depth-and-Width Limit., , , , , , and . CoRR, (2023)Information Complexity of Stochastic Convex Optimization: Applications to Generalization and Memorization., , , , and . CoRR, (2024)Posterior distributions are computable from predictive distributions., and . AISTATS, volume 9 of JMLR Proceedings, page 233-240. JMLR.org, (2010)The Mondrian Process, and . (2009)The Subseafloor Biosphere at Mid-Ocean Ridges, Geophysical Monograph Series, V. 144, and . chapter The Upper Temperature Limit for Life Based on Hyperthermophile Culture Experiments and Field Observations, page 13-24. AGU, (2004)The Infinite Latent Events Model., , , and . UAI, page 607-614. AUAI Press, (2009)Pruning's Effect on Generalization Through the Lens of Training and Regularization., , , , and . NeurIPS, (2022)