Author of the publication

Multimodal Explanations: Justifying Decisions and Pointing to the Evidence.

, , , , , , and . CVPR, page 8779-8788. IEEE Computer Society, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Translating Videos to Natural Language Using Deep Recurrent Neural Networks., , , , , and . CoRR, (2014)FLAVA: A Foundational Language And Vision Alignment Model., , , , , , and . CoRR, (2021)Long-term Recurrent Convolutional Networks for Visual Recognition and Description., , , , , , and . CoRR, (2014)Memory Aware Synapses: Learning what (not) to forget., , , , and . CoRR, (2017)Efficient Lifelong Learning with A-GEM., , , and . ICLR (Poster), OpenReview.net, (2019)Improving Selective Visual Question Answering by Learning from Your Peers., , , , , , , and . CVPR, page 24049-24059. IEEE, (2023)High-Level Fusion of Depth and Intensity for Pedestrian Classification., , and . DAGM-Symposium, volume 5748 of Lecture Notes in Computer Science, page 101-110. Springer, (2009)Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly., , , , , , and . ECCV (36), volume 13696 of Lecture Notes in Computer Science, page 148-166. Springer, (2022)FLAVA: A Foundational Language And Vision Alignment Model., , , , , , and . CVPR, page 15617-15629. IEEE, (2022)Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding., , , , , and . EMNLP, page 457-468. The Association for Computational Linguistics, (2016)