Author of the publication

Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models.

, , , , , , and . EMNLP, page 9904-9923. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Parser combinators for Tigrinya and Oromo morphology., , , , , , , and . LREC, European Language Resources Association (ELRA), (2018)Low-Resource Machine Translation using Interlinear Glosses., , , and . CoRR, (2019)Mitigating the Linguistic Gap with Phonemic Representations for Robust Multilingual Language Understanding., , , , , , and . CoRR, (2024)Data-adaptive Transfer Learning for Translation: A Case Study in Haitian and Jamaican., , , and . LoResMT@COLING, page 35-42. Association for Computational Linguistics, (2022)Wav2Gloss: Generating Interlinear Glossed Text from Speech., , , , , , , , and . CoRR, (2024)ChatGPT MT: Competitive for High- (but Not Low-) Resource Languages., , , and . WMT, page 392-418. Association for Computational Linguistics, (2023)Bridge-Language Capitalization Inference in Western Iranian: Sorani, Kurmanji, Zazaki, and Tajik., , , , and . LREC, European Language Resources Association (ELRA), (2016)CMU-01 at the SIGMORPHON 2019 Shared Task on Crosslinguality and Context in Morphology., , , , , and . CoRR, (2019)Quantifying Cognitive Factors in Lexical Decline., , , , and . CoRR, (2021)The ARIEL-CMU Systems for LoReHLT18., , , , , , , , , and 20 other author(s). CoRR, (2019)