Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Layerwise Bregman Representation Learning of Neural Networks with Applications to Knowledge Distillation., , , and . Trans. Mach. Learn. Res., (2023)Sketchy: Memory-efficient Adaptive Regularization with Frequent Directions., , , , and . CoRR, (2023)A Computationally Efficient Sparsified Online Newton Method., , , , , and . CoRR, (2023)Benchmarking Neural Network Training Algorithms., , , , , , , , , and 15 other author(s). CoRR, (2023)TF-Ranking: Scalable TensorFlow Library for Learning-to-Rank., , , , , , , , , and . CoRR, (2018)Stochastic Optimization with Laggard Data Pipelines., , , , and . NeurIPS, (2020)Large-Scale Differentially Private BERT., , , , and . EMNLP (Findings), page 6481-6491. Association for Computational Linguistics, (2022)Knowledge distillation: A good teacher is patient and consistent., , , , , and . CVPR, page 10915-10924. IEEE, (2022)Measuring and Harnessing Transference in Multi-Task Learning., , , , , and . CoRR, (2020)Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling., , , , , , , , , and 81 other author(s). CoRR, (2019)