Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Multimodal Self-Supervised Learning of General Audio Representations., , , , and . CoRR, (2021)Controllable Attention for Structured Layered Video Decomposition., , , and . ICCV, page 5733-5742. IEEE, (2019)Three ways to improve feature alignment for open vocabulary detection., , , , , and . CoRR, (2023)Zorro: the masked multimodal transformer., , , , , , , , , and 1 other author(s). CoRR, (2023)Decoupling the Role of Data, Attention, and Losses in Multimodal Transformers., , , , and . CoRR, (2021)End-to-End Learning of Visual Representations from Uncurated Instructional Videos., , , , , and . CoRR, (2019)Perceiver IO: A General Architecture for Structured Inputs & Outputs., , , , , , , , , and 5 other author(s). CoRR, (2021)Gemini: A Family of Highly Capable Multimodal Models., , , , , , , , , and 42 other author(s). CoRR, (2023)End-to-End Learning of Visual Representations From Uncurated Instructional Videos., , , , , and . CVPR, page 9876-9886. Computer Vision Foundation / IEEE, (2020)Thinking Fast and Slow: Efficient Text-to-Visual Retrieval With Transformers., , , , and . CVPR, page 9826-9836. Computer Vision Foundation / IEEE, (2021)