Author of the publication

Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering.

, , , , , , and . CVPR, page 6077-6086. Computer Vision Foundation / IEEE Computer Society, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

New methods and evaluation experiments on translating TED talks in the IWSLT benchmark., , , , and . ICASSP, page 4945-4948. IEEE, (2012)An efficient confusing choices decoupling framework for multi-choice tasks over texts., , , , , , and . Neural Comput. Appl., 36 (1): 259-271 (January 2024)Direction of Arrival Estimation Based on the Multistage Nested Wiener Filter., and . IJDSN, (2015)Tips and Tricks for Visual Question Answering: Learnings from the 2017 Challenge., , , and . CoRR, (2017)Multiple-Kernel Based Vehicle Tracking Using 3D Deformable Model and Camera Self-Calibration., , , , , , , and . CoRR, (2017)DR-GAN: Conditional Generative Adversarial Network for Fine-Grained Lesion Synthesis on Diabetic Retinopathy Images., , , , , , and . CoRR, (2019)The practice of speech and language processing in China., , , , , and . Commun. ACM, 64 (11): 81-87 (2021)Hierarchically Structured Reinforcement Learning for Topically Coherent Visual Story Generation., , , , , and . CoRR, (2018)MaskedSpeech: Context-aware Speech Synthesis with Masking Strategy., , , , , and . CoRR, (2022)UFO2: A unified pre-training framework for online and offline speech recognition., , , , , , , and . CoRR, (2022)