Author of the publication

Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models.

, , , , , and . ICCV, page 2641-2649. IEEE Computer Society, (2015)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Phrase Localization and Visual Relationship Detection with Comprehensive Linguistic Cues., , , , and . CoRR, (2016)One-Shot Stylization for Full-Body Human Images., and . CoRR, (2023)Towards Open-Universe Image Parsing with Broad Coverage., and . MVA, page 13-20. (2013)Combining Multiple Cues for Visual Madlibs Question Answering., , , , , and . Int. J. Comput. Vis., 127 (1): 38-60 (2019)Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs., , , and . Int. J. Comput. Vis., 95 (3): 213-239 (2011)Shadows Don't Lie and Lines Can't Bend! Generative Models don't know Projective Geometry...for now., , , , , and . CoRR, (2023)Revisiting Image-Language Networks for Open-Ended Phrase Detection., , , , , , and . IEEE Trans. Pattern Anal. Mach. Intell., 44 (4): 2155-2167 (2022)GridToPix: Training Embodied Agents with Minimal Supervision., , , , , and . ICCV, page 15121-15131. IEEE, (2021)Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering., , and . NeurIPS, page 2659-2670. (2018)Solving VIsual Madlibs with Multiple Cues., , , , , and . BMVC, BMVA Press, (2016)