Author of the publication

Only 5% Attention Is All You Need: Efficient Long-range Document-level Neural Machine Translation.

, , , , and . IJCNLP (1), page 733-743. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A caching-list based fast handoff mechanism in wireless mesh networks., , , , and . ICTC, page 402-407. IEEE, (2013)Better Datastore, Better Translation: Generating Datastores from Pre-Trained Models for Nearest Neural Machine Translation., , , , and . CoRR, (2022)NeurST: Neural Speech Translation Toolkit., , and . CoRR, (2020)LightSeq: A High Performance Inference Library for Transformers., , , , and . NAACL-HLT (Industry Papers), page 113-120. Association for Computational Linguistics, (2021)Cross-modal Contrastive Learning for Speech Translation., , and . NAACL-HLT, page 5099-5113. Association for Computational Linguistics, (2022)GigaST: A 10, 000-hour Pseudo Speech Translation Corpus., , , , , , and . CoRR, (2022)Unified Multimodal Punctuation Restoration Framework for Mixed-Modality Corpus., , , and . ICASSP, page 7272-7276. IEEE, (2022)genCNN: A Convolutional Architecture for Word Sequence Prediction., , , , and . ACL (1), page 1567-1576. The Association for Computer Linguistics, (2015)BigVideo: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine Translation., , , , , , , , and . ACL (Findings), page 8456-8473. Association for Computational Linguistics, (2023)Beyond Triplet: Leveraging the Most Data for Multimodal Machine Translation., , , , , and . ACL (Findings), page 2679-2697. Association for Computational Linguistics, (2023)