Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Zorro: the masked multimodal transformer., , , , , , , , , and 1 other author(s). CoRR, (2023)End-to-End Learning of Visual Representations from Uncurated Instructional Videos., , , , , and . CoRR, (2019)End-to-End Learning of Visual Representations From Uncurated Instructional Videos., , , , , and . CVPR, page 9876-9886. Computer Vision Foundation / IEEE, (2020)Human-Agent Cooperation in Bridge Bidding., , , , , , and . CoRR, (2020)TAP-Vid: A Benchmark for Tracking Any Point in a Video., , , , , , , , and . NeurIPS, (2022)A Short Note on the Kinetics-700-2020 Human Action Dataset., , , , , and . CoRR, (2020)Towards Learning Universal Audio Representations., , , , , , , , , and 1 other author(s). CoRR, (2021)Perception Test: A Diagnostic Benchmark for Multimodal Video Models., , , , , , , , , and 14 other author(s). CoRR, (2023)Towards Learning Universal Audio Representations., , , , , , , , , and 1 other author(s). ICASSP, page 4593-4597. IEEE, (2022)Visual Grounding in Video for Unsupervised Word Translation., , , , , , , and . CVPR, page 10847-10856. Computer Vision Foundation / IEEE, (2020)