Author of the publication

Efficient Multilingual Multi-modal Pre-training through Triple Contrastive Loss.

, , , , and . COLING, page 5730-5744. International Committee on Computational Linguistics, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Towards a Complete Benchmark on Video Moment Localization., , , , , , , , , and . AISTATS, volume 238 of Proceedings of Machine Learning Research, page 4168-4176. PMLR, (2024)Sparse DETR: Efficient End-to-End Object Detection with Learnable Sparsity., , , and . ICLR, OpenReview.net, (2022)CXR-CLIP: Toward Large Scale Chest X-ray Language-Image Pre-training., , , , , , , and . MICCAI (2), volume 14221 of Lecture Notes in Computer Science, page 101-111. Springer, (2023)Accelerating Object Detection by Erasing Background Activations., , , and . CoRR, (2020)Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning., , , and . ICCV, page 2930-2940. IEEE, (2023)MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models., , , , , and . CVPR, page 20105-20115. IEEE, (2023)Large Language Models are Temporal and Causal Reasoners for Video Question Answering., , , , and . EMNLP, page 4300-4316. Association for Computational Linguistics, (2023)Learning to Generate Text-Grounded Mask for Open-World Semantic Segmentation from Only Image-Text Pairs., , and . CVPR, page 11165-11174. IEEE, (2023)Spatially Consistent Representation Learning., , , and . CVPR, page 1144-1153. Computer Vision Foundation / IEEE, (2021)Efficient Multilingual Multi-modal Pre-training through Triple Contrastive Loss., , , , and . COLING, page 5730-5744. International Committee on Computational Linguistics, (2022)