Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Efficient Parallelization Layouts for Large-Scale Distributed Model Training., , , , and . CoRR, (2023)Tokenizer Choice For LLM Training: Negligible or Crucial?, , , , , , , , , and 11 other author(s). CoRR, (2023)MAGMA - Multimodal Augmentation of Generative Models through Adapter-based Finetuning., , , , and . CoRR, (2021)Domain-Level Explainability - A Challenge for Creating Trust in Superhuman AI Strategies., , , , and . CoRR, (2020)GPT-NeoX-20B: An Open-Source Autoregressive Language Model., , , , , , , , , and 7 other author(s). CoRR, (2022)M-VADER: A Model for Diffusion with Multimodal Context., , , , , , , , , and . CoRR, (2022)MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation., , , , , , , , , and 4 other author(s). CoRR, (2023)AtMan: Understanding Transformer Predictions Through Memory Efficient Attention Manipulation., , , , and . CoRR, (2023)MAGMA - Multimodal Augmentation of Generative Models through Adapter-based Finetuning., , , , and . EMNLP (Findings), page 2416-2428. Association for Computational Linguistics, (2022)