Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation, , , , , , , , , and 20 other author(s). arXiv.org, (September 2016)Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges., , , , , , , , , and 3 other author(s). CoRR, (2019)Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling., , , , , , , , , and 81 other author(s). CoRR, (2019)Beyond Distillation: Task-level Mixture-of-Experts for Efficient Inference., , , , , , and . EMNLP (Findings), page 3577-3599. Association for Computational Linguistics, (2021)Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation, , , , , , , , , and 21 other author(s). (2016)cite arxiv:1609.08144.LaMDA: Language Models for Dialog Applications., , , , , , , , , and 47 other author(s). CoRR, (2022)Subtitle Translation as Markup Translation., , , and . Interspeech, page 2237-2241. ISCA, (2021)GLaM: Efficient Scaling of Language Models with Mixture-of-Experts., , , , , , , , , and 17 other author(s). ICML, volume 162 of Proceedings of Machine Learning Research, page 5547-5569. PMLR, (2022)Explicit Enumeration of Triangulations with Multiple Boundaries.. Electron. J. Comb., (2007)Data Scaling Laws in NMT: The Effect of Noise and Architecture., , , , , , , and . CoRR, (2022)