Author of the publication

Combining Global Sparse Gradients with Local Gradients in Distributed Neural Network Training.

, , and . EMNLP/IJCNLP (1), page 3624-3629. Association for Computational Linguistics, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca., , , , and . CoRR, (2023)Cheat Codes to Quantify Missing Source Information in Neural Machine Translation., and . NAACL-HLT, page 2472-2477. Association for Computational Linguistics, (2022)Marian: Fast Neural Machine Translation in C++., , , , , , , , , and 2 other author(s). ACL (4), page 116-121. Association for Computational Linguistics, (2018)Sparse Communication for Distributed Gradient Descent., and . EMNLP, page 440-445. Association for Computational Linguistics, (2017)Language Model Rest Costs and Space-Efficient Storage., , and . EMNLP-CoNLL, page 1169-1178. ACL, (2012)The Sockeye 2 Neural Machine Translation Toolkit at AMTA 2020., , , , , and . AMTA, page 110-115. Association for Machine Translation in the Americas, (2020)Edinburgh's Submissions to the 2020 Machine Translation Efficiency Task., , , , , , , and . NGT@ACL, page 218-224. Association for Computational Linguistics, (2020)Iterative Translation Refinement with Large Language Models., , , and . CoRR, (2023)Exploring Monolingual Data for Neural Machine Translation with Knowledge Distillation., and . CoRR, (2020)Findings of the WMT 2018 Shared Task on Parallel Corpus Filtering., , , and . WMT (shared task), page 726-739. Association for Computational Linguistics, (2018)