Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Scaling Local Self-Attention for Parameter Efficient Visual Backbones., , , , , and . CVPR, page 12894-12904. Computer Vision Foundation / IEEE, (2021)Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation., , , , , and . ACL (1), page 1862-1872. Association for Computational Linguistics, (2019)Rule Markov Models for Fast Tree-to-String Translation., , , and . ACL, page 856-864. The Association for Computer Linguistics, (2011)Smaller Alignment Models for Better Translations: Unsupervised Word Alignment with the l0-norm., , and . ACL (1), page 311-319. The Association for Computer Linguistics, (2012)Scale Efficiently: Insights from Pretraining and Finetuning Transformers., , , , , , , , , and . ICLR, OpenReview.net, (2022)Decoding with Large-Scale Neural Language Models Improves Translation., , , and . EMNLP, page 1387-1392. ACL, (2013)Rethinking Reflection in Pre-Training., , , , , , , , , and 17 other author(s). CoRR, (April 2025)Efficient Content-Based Sparse Attention with Routing Transformers., , , and . CoRR, (2020)Music Transformer, , , , , , , , , and . (2018)cite arxiv:1809.04281Comment: Improved skewing section and accompanying figures. Previous titles are Än Improved Relative Self-Attention Mechanism for Transformer with Application to Music Generation" and "Music Transformer".Radiobot-CFF: a spoken dialogue system for military training., , , , , , and . INTERSPEECH, ISCA, (2006)