Author of the publication

Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded.

, , , , , , , and . ICCV, page 2591-2600. IEEE, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Visual Conceptual Blending with Large-Scale Language and Vision Models., and . ICCC, page 6-10. Association for Computational Creativity (ACC), (2021)Feel The Music: Automatically Generating A Dance For An Input Song., , , and . ICCC, page 292-295. Association for Computational Creativity (ACC), (2020)Sim-to-Real Transfer for Vision-and-Language Navigation., , , , , , and . CoRL, volume 155 of Proceedings of Machine Learning Research, page 671-681. PMLR, (2020)12-in-1: Multi-Task Vision and Language Representation Learning., , , , and . CVPR, page 10434-10443. Computer Vision Foundation / IEEE, (2020)Vx2Text: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs., , , , , and . CVPR, page 7005-7015. Computer Vision Foundation / IEEE, (2021)KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA., , , , and . CoRR, (2020)Text-Conditional Contextualized Avatars For Zero-Shot Personalization., , , , , and . CoRR, (2023)We Are Humor Beings: Understanding and Predicting Visual Humor., , , , , , and . CoRR, (2015)Visual Word2Vec (vis-w2v): Learning Visually Grounded Word Embeddings Using Abstract Scenes., , , and . CoRR, (2015)CoDraw: Visual Dialog for Collaborative Drawing., , , , and . CoRR, (2017)