Author of the publication

Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?

, , , , , , , , , and . EMNLP (Findings), page 12342-12364. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Universal Transformers., , , , and . ICLR (Poster), OpenReview.net, (2019)MetNet: A Neural Weather Model for Precipitation Forecasting., , , , , , , , and . CoRR, (2020)Molecular simulation and Monte Carlo study of structural-transport-properties of PEBA-MFI zeolite mixed matrix membranes for CO2, CH4 and N2 separation., , , and . Comput. Chem. Eng., (2017)SIGIR 2018 Workshop on Learning from Limited or Noisy Data for Information Retrieval., , , , and . SIGIR, page 1439-1440. ACM, (2018)Gradual Domain Adaptation in the Wild: When Intermediate Distributions are Absent., , , , , and . CoRR, (2021)Learning to rank for multi-label text classification: Combining different sources of information., , , and . Nat. Lang. Eng., 27 (1): 89-111 (2021)How (not) to ensemble LVLMs for VQA., , , , , and . CoRR, (2023)Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?, , , , , , , , , and . CoRR, (2022)VUT: Versatile UI Transformer for Multi-Modal Multi-Task User Interface Modeling., , , , and . CoRR, (2021)The Efficiency Misnomer., , , , and . CoRR, (2021)