Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Scaling Up Models and Data with t5x and seqio., , , , , , , , , and 35 other author(s). J. Mach. Learn. Res., (2023)ShopTalk: A System for Conversational Faceted Search., , , , , , , , , and 4 other author(s). CoRR, (2021)FNet: Mixing Tokens with Fourier Transforms., , , and . NAACL-HLT, page 4296-4313. Association for Computational Linguistics, (2022)Sparse Mixers: Combining MoE and Mixing to build a more efficient BERT., and . EMNLP (Findings), page 58-75. Association for Computational Linguistics, (2022)GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints., , , , , and . EMNLP, page 4895-4901. Association for Computational Linguistics, (2023)Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints., , , , , , , , and . ICLR, OpenReview.net, (2023)Memory Augmented Language Models through Mixture of Word Experts., , , , and . NAACL-HLT, page 4425-4438. Association for Computational Linguistics, (2024)FNet: Mixing Tokens with Fourier Transforms., , , and . CoRR, (2021)Memory Augmented Language Models through Mixture of Word Experts., , , , and . CoRR, (2023)CoLT5: Faster Long-Range Transformers with Conditional Computation., , , , , , , , , and 2 other author(s). EMNLP, page 5085-5100. Association for Computational Linguistics, (2023)