Author of the publication

Scene-robust Natural Language Video Localization via Learning Domain-invariant Representations.

, , , , and . ACL (Findings), page 144-160. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Saliency based proposal refinement in robotic vision., , and . RCAR, page 85-90. IEEE, (2017)Video Question Answering via Knowledge-based Progressive Spatial-Temporal Attention Network., , , , , and . TOMM, 15 (2s): 52:1-52:22 (2019)Temporal Interaction and Causal Influence in Community-Based Question Answering., , , , , , and . IEEE Trans. Knowl. Data Eng., 29 (10): 2304-2317 (2017)Learning Max-Margin GeoSocial Multimedia Network Representations for Point-of-Interest Suggestion., , , , , , and . SIGIR, page 833-836. ACM, (2017)Video Dialog via Multi-Grained Convolutional Self-Attention Context Networks., , , , , and . SIGIR, page 465-474. ACM, (2019)Efficient location-based search of trajectories with location importance., , , and . Knowl. Inf. Syst., 45 (1): 215-245 (2015)TaoHighlight: Commodity-Aware Multi-Modal Video Highlight Detection in E-Commerce., , , , , and . IEEE Trans. Multim., (2022)Generation Method for Shaded Relief Based on Conditional Generative Adversarial Nets., , , , and . ISPRS Int. J. Geo Inf., 11 (7): 374 (2022)AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head., , , , , , , , , and 3 other author(s). CoRR, (2023)Frame-Subtitle Self-Supervision for Multi-Modal Video Question Answering., , and . CoRR, (2022)