Author of the publication

The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study.

, , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 5042-5051. PMLR, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Efficient Knowledge Distillation for RNN-Transducer Models., , , , , and . CoRR, (2020)SpeechStew: Simply Mix All Available Speech Recognition Data to Train One Large Neural Network., , , , , and . CoRR, (2021)Noise2Music: Text-conditioned Music Generation with Diffusion Models., , , , , , , , , and 4 other author(s). CoRR, (2023)The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study., , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 5042-5051. PMLR, (2019)Universal Paralinguistic Speech Representations Using self-Supervised Conformers., , , , and . ICASSP, page 3169-3173. IEEE, (2022)G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR., , , , , , , , and . SLT, page 23-30. IEEE, (2022)SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition., , , , , , and . INTERSPEECH, page 2613-2617. ISCA, (2019)Improved Noisy Student Training for Automatic Speech Recognition., , , , , , , and . INTERSPEECH, page 2817-2821. ISCA, (2020)Open Science principles for accelerating trait-based science across the Tree of Life, , , , , , , , , and 47 other author(s). Nature Ecology & Evolution, 4 (3): 294–303 (February 2020)Universal Paralinguistic Speech Representations Using Self-Supervised Conformers., , , , and . CoRR, (2021)