Author of the publication

Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models.

, , , and . ICCV, page 2105-2114. IEEE, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

DORi: Discovering Object Relationship for Moment Localization of a Natural-Language Query in Video., , , , and . CoRR, (2020)Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison., , , and . CoRR, (2019)Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison., , , and . WACV, page 1448-1458. IEEE, (2020)VLN BERT: A Recurrent Vision-and-Language BERT for Navigation., , , , and . CVPR, page 1643-1653. Computer Vision Foundation / IEEE, (2021)LocFormer: Enabling Transformers to Perform Temporal Moment Localization on Long Untrimmed Videos With a Feature Sampling Approach., , , , and . CoRR, (2021)Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models., , , and . ICCV, page 2105-2114. IEEE, (2021)Memory-efficient Temporal Moment Localization in Long Videos., , , , and . EACL, page 1901-1916. Association for Computational Linguistics, (2023)Divide and Conquer: Efficient Density-Based Tracking of 3D Sensors in Manhattan Worlds., , , and . ACCV (5), volume 10115 of Lecture Notes in Computer Science, page 3-19. Springer, (2016)Action Anticipation by Predicting Future Dynamic Images., , and . ECCV Workshops (3), volume 11131 of Lecture Notes in Computer Science, page 89-105. Springer, (2018)Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention., , , , and . WACV, page 2453-2462. IEEE, (2020)