Author of the publication

Improving What Cross-Modal Retrieval Models Learn through Object-Oriented Inter- and Intra-Modal Attention Networks.

, , , and . ICMR, page 244-252. ACM, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models., , , , , , , , , and 9 other author(s). CoRR, (2023)Panoramic depth reconstruction within a single shot by optimizing global sphere radii., , , , and . SIGGRAPH ASIA Posters, page 80:1-80:2. ACM, (2018)Cognitive access in multichannel wireless networks using two-dimension Markov chain., , and . IWCMC, page 169-173. IEEE, (2014)MAViL: Masked Audio-Video Learners., , , , , , , , , and . CoRR, (2022)Informedia @ TRECVID 2018: Ad-hoc Video Search, Video to Text Description, Activities in Extended video., , , , , , , , , and 9 other author(s). TRECVID, National Institute of Standards and Technology (NIST), (2018)Improving What Cross-Modal Retrieval Models Learn through Object-Oriented Inter- and Intra-Modal Attention Networks., , , and . ICMR, page 244-252. ACM, (2019)Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles., , , , , , , , , and 3 other author(s). ICML, volume 202 of Proceedings of Machine Learning Research, page 29441-29454. PMLR, (2023)Generating Hashtags for Short-form Videos with Guided Signals., , , , , , , , and . ACL (1), page 9482-9495. Association for Computational Linguistics, (2023)Cognitive vertical handover in heterogeneous networks., , and . QSHINE, page 392-397. IEEE, (2015)RCAA: Relational Context-Aware Agents for Person Search., , , , , and . ECCV (9), volume 11213 of Lecture Notes in Computer Science, page 86-102. Springer, (2018)