Author of the publication

Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models.

, , , , , and . ICCV, page 2641-2649. IEEE Computer Society, (2015)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Semantic Image Manipulation with Background-guided Internal Learning., , , , and . CoRR, (2022)Explaining Reinforcement Learning Policies through Counterfactual Trajectories., , , , , , and . CoRR, (2022)Give me a hint! Navigating Image Databases using Human-in-the-loop Feedback., , , and . CoRR, (2018)Learning Type-Aware Embeddings for Fashion Compatibility., , , , , and . ECCV (16), volume 11220 of Lecture Notes in Computer Science, page 405-421. Springer, (2018)Human Evaluation of Text-to-Image Models on a Multi-Task Benchmark., , , , , , , , , and 4 other author(s). CoRR, (2022)Socratis: Are large multimodal models emotionally aware?, , , , , and . CoRR, (2023)Language-Guided Audio-Visual Source Separation via Trimodal Consistency., , , , , , , and . CVPR, page 10575-10584. IEEE, (2023)MULE: Multimodal Universal Language Embedding., , , , and . AAAI, page 11254-11261. AAAI Press, (2020)Self-supervised Visual Attribute Learning for Fashion Compatibility., , , , , and . ICCVW, page 1057-1066. IEEE, (2021)Complex Scene Image Editing by Scene Graph Comprehension., , , , and . BMVC, page 451. BMVA Press, (2023)