Author of the publication

MTGAT: Multimodal Temporal Graph Attention Networks for Unaligned Human Multimodal Language Sequences.

, , , , , , , and . CoRR, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Combining Active and Semi-Supervised Learning for Homograph Disambiguation in Mandarin Text-to-Speech Synthesis., , , and . INTERSPEECH, page 2165-2168. ISCA, (2011)What Gives the Answer Away? Question Answering Bias Analysis on Video QA Datasets., , , , , and . CoRR, (2020)MTAG: Modal-Temporal Attention Graph for Unaligned Human Multimodal Language Sequences., , , , , , , and . NAACL-HLT, page 1009-1021. Association for Computational Linguistics, (2021)High-Dimensional Sparse Cross-Modal Hashing with Fine-Grained Similarity Embedding., , , and . WWW, page 2900-2909. ACM / IW3C2, (2021)A vision transformer for fine-grained classification by reducing noise and enhancing discriminative information., , , , and . Pattern Recognit., (January 2024)Pixel Invisibility: Detecting Objects Invisible in Color Images., and . CoRR, (2020)Graph Neural Networks for 3D Multi-Object Tracking., , , and . CoRR, (2020)Unsupervised and Semi-supervised Bias Benchmarking in Face Recognition., , , , and . ECCV (13), volume 13673 of Lecture Notes in Computer Science, page 289-306. Springer, (2022)Connecting Gaze, Scene, and Attention: Generalized Attention Estimation via Joint Modeling of Gaze and Scene Saliency., , , , , and . ECCV (5), volume 11209 of Lecture Notes in Computer Science, page 397-412. Springer, (2018)A Two-Step Cross-Modal Hashing by Exploiting Label Correlations and Preserving Similarity in Both Steps., , , , , and . ACM Multimedia, page 1694-1702. ACM, (2019)