Author of the publication

UniMax: Fairer and More Effective Language Sampling for Large-Scale Multilingual Pretraining.

, , , , , , and . ICLR, OpenReview.net, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

The Efficiency Misnomer., , , , and . ICLR, OpenReview.net, (2022)ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning., , , , , , , , , and 4 other author(s). ICLR, OpenReview.net, (2022)Robust Representation Learning of Biomedical Names., , and . ACL (1), page 3275-3285. Association for Computational Linguistics, (2019)On Orthogonality Constraints for Transformers., , , , , , , , and . ACL/IJCNLP (2), page 375-382. Association for Computational Linguistics, (2021)How Reliable are Model Diagnostics?, , and . ACL/IJCNLP (Findings), volume ACL/IJCNLP 2021 of Findings of ACL, page 1778-1785. Association for Computational Linguistics, (2021)Holistic Multi-modal Memory Network for Movie Question Answering., , , , , and . CoRR, (2018)What it Thinks is Important is Important: Robustness Transfers through Input Gradients., , and . CoRR, (2019)Charformer: Fast Character Transformers via Gradient-based Subword Tokenization., , , , , , , , , and . CoRR, (2021)NeuPL: Attention-based Semantic Matching and Pair-Linking for Entity Disambiguation., , , , and . CIKM, page 1667-1676. ACM, (2017)Neural architectures for natural language understanding. Nanyang Technological University, Singapore, (2019)