From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Multimodal Self-Supervised Learning of General Audio Representations., , , , и . CoRR, (2021)Three ways to improve feature alignment for open vocabulary detection., , , , , и . CoRR, (2023)Controllable Attention for Structured Layered Video Decomposition., , , и . ICCV, стр. 5733-5742. IEEE, (2019)Zorro: the masked multimodal transformer., , , , , , , , , и 1 other автор(ы). CoRR, (2023)Decoupling the Role of Data, Attention, and Losses in Multimodal Transformers., , , , и . Trans. Assoc. Comput. Linguistics, (2021)End-to-End Learning of Visual Representations from Uncurated Instructional Videos., , , , , и . CoRR, (2019)Gemini: A Family of Highly Capable Multimodal Models., , , , , , , , , и 42 other автор(ы). CoRR, (2023)End-to-End Learning of Visual Representations From Uncurated Instructional Videos., , , , , и . CVPR, стр. 9876-9886. Computer Vision Foundation / IEEE, (2020)Thinking Fast and Slow: Efficient Text-to-Visual Retrieval With Transformers., , , , и . CVPR, стр. 9826-9836. Computer Vision Foundation / IEEE, (2021)Perceiver IO: A General Architecture for Structured Inputs & Outputs., , , , , , , , , и 5 other автор(ы). CoRR, (2021)