Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Combining Global Sparse Gradients with Local Gradients in Distributed Neural Network Training.

A. Aji, K. Heafield, and N. Bogoychev. EMNLP/IJCNLP (1), page 3624-3629. Association for Computational Linguistics, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Kenneth Froese

Kenneth Anders

Kenneth Simon

Kenneth Ndyabawe

Kenneth Vanhoey

Other publications of authors with the same name

Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca.P. Chen, S. Ji, N. Bogoychev, B. Haddow, and K. Heafield. CoRR, (2023)Cheat Codes to Quantify Missing Source Information in Neural Machine Translation.P. Pal, and K. Heafield. NAACL-HLT, page 2472-2477. Association for Computational Linguistics, (2022)Marian: Fast Neural Machine Translation in C++.M. Junczys-Dowmunt, R. Grundkiewicz, T. Dwojak, H. Hoang, K. Heafield, T. Neckermann, F. Seide, U. Germann, A. Aji, N. Bogoychev and 2 other author(s). ACL (4), page 116-121. Association for Computational Linguistics, (2018)Sparse Communication for Distributed Gradient Descent.A. Aji, and K. Heafield. EMNLP, page 440-445. Association for Computational Linguistics, (2017)Language Model Rest Costs and Space-Efficient Storage.K. Heafield, P. Koehn, and A. Lavie. EMNLP-CoNLL, page 1169-1178. ACL, (2012)The Sockeye 2 Neural Machine Translation Toolkit at AMTA 2020.T. Domhan, M. Denkowski, D. Vilar, X. Niu, F. Hieber, and K. Heafield. AMTA, page 110-115. Association for Machine Translation in the Americas, (2020)Edinburgh's Submissions to the 2020 Machine Translation Efficiency Task.N. Bogoychev, R. Grundkiewicz, A. Aji, M. Behnke, K. Heafield, S. Kashyap, E. Farsarakis, and M. Chudyk. NGT@ACL, page 218-224. Association for Computational Linguistics, (2020)Iterative Translation Refinement with Large Language Models.P. Chen, Z. Guo, B. Haddow, and K. Heafield. CoRR, (2023)Exploring Monolingual Data for Neural Machine Translation with Knowledge Distillation.A. Aji, and K. Heafield. CoRR, (2020)Findings of the WMT 2018 Shared Task on Parallel Corpus Filtering.P. Koehn, H. Khayrallah, K. Heafield, and M. Forcada. WMT (shared task), page 726-739. Association for Computational Linguistics, (2018)

BibSonomy

Disambiguation of "Heafield, Kenneth"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Combining Global Sparse Gradients with Local Gradients in Distributed Neural Network Training.

Please choose a person to relate this publication to

Kenneth Froese

Kenneth Anders

Kenneth Simon

Kenneth Ndyabawe

Kenneth Vanhoey

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Heafield, Kenneth"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Combining Global Sparse Gradients with Local Gradients in Distributed Neural Network Training.

Please choose a person to relate this publication to

Kenneth Froese

Kenneth Anders

Kenneth Simon

Kenneth Ndyabawe

Kenneth Vanhoey

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Combining Global Sparse Gradients with Local Gradients in Distributed Neural Network Training.