Author of the publication

Sparse Communication for Distributed Gradient Descent.

, and . EMNLP, page 440-445. Association for Computational Linguistics, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca., , , , and . CoRR, (2023)Cheat Codes to Quantify Missing Source Information in Neural Machine Translation., and . NAACL-HLT, page 2472-2477. Association for Computational Linguistics, (2022)Edinburgh's Submissions to the 2020 Machine Translation Efficiency Task., , , , , , , and . NGT@ACL, page 218-224. Association for Computational Linguistics, (2020)The Sockeye 2 Neural Machine Translation Toolkit at AMTA 2020., , , , , and . AMTA, page 110-115. Association for Machine Translation in the Americas, (2020)Marian: Fast Neural Machine Translation in C++., , , , , , , , , and 2 other author(s). ACL (4), page 116-121. Association for Computational Linguistics, (2018)Sparse Communication for Distributed Gradient Descent., and . EMNLP, page 440-445. Association for Computational Linguistics, (2017)Language Model Rest Costs and Space-Efficient Storage., , and . EMNLP-CoNLL, page 1169-1178. ACL, (2012)Iterative Translation Refinement with Large Language Models., , , and . CoRR, (2023)Exploring Monolingual Data for Neural Machine Translation with Knowledge Distillation., and . CoRR, (2020)Making Asynchronous Stochastic Gradient Descent Work for Transformers., and . NGT@EMNLP-IJCNLP, page 80-89. Association for Computational Linguistics, (2019)