Author of the publication

An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing.

, , , , , , , , , , and . ACL (1), page 11385-11396. Association for Computational Linguistics, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Artifact Disentanglement Network for Unsupervised Metal Artifact Reduction., , , , and . MICCAI (6), volume 11769 of Lecture Notes in Computer Science, page 203-211. Springer, (2019)Solving cold-start problem in large-scale recommendation engines: A deep learning approach., , , , , and . IEEE BigData, page 1901-1910. IEEE Computer Society, (2016)The effect of pets on happiness: A data-driven approach via large-scale social media., , , and . IEEE BigData, page 1889-1894. IEEE Computer Society, (2016)Self-Infilling Code Generation., , , , and . CoRR, (2023)LEMON: Lossless model expansion., , , , , , , , and . CoRR, (2023)A Reparameterized Discrete Diffusion Model for Text Generation., , , and . CoRR, (2023)Let's reward step by step: Step-Level reward model as the Navigators for Reasoning., , , , , , and . CoRR, (2023)Sentribute: image sentiment analysis from a mid-level perspective., , , and . WISDOM, page 10:1-10:8. ACM, (2013)HiCLIP: Contrastive Language-Image Pretraining with Hierarchy-aware Attention., , , , and . ICLR, OpenReview.net, (2023)LoBaSS: Gauging Learnability in Supervised Fine-tuning Data., , , , , , and . CoRR, (2023)