Author of the publication

LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form Video-Text Understanding.

, , , , , and . CoRR, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

MoviePuzzle: Visual Narrative Reasoning through Multimodal Order Learning., , , and . CoRR, (2023)Learning Triadic Belief Dynamics in Nonverbal Communication From Videos., , , , , and . CVPR, page 7312-7321. Computer Vision Foundation / IEEE, (2021)Learning Descriptor Networks for 3D Shape Synthesis and Analysis., , , , , and . CVPR, page 8629-8638. Computer Vision Foundation / IEEE Computer Society, (2018)MindAgent: Emergent Gaming Interaction., , , , , , , , , and 1 other author(s). CoRR, (2023)GRICE: A Grammar-based Dataset for Recovering Implicature and Conversational rEasoning., , , , and . ACL/IJCNLP (Findings), volume ACL/IJCNLP 2021 of Findings of ACL, page 2074-2085. Association for Computational Linguistics, (2021)Rethinking Dictionaries and Glyphs for Chinese Language Pre-training., , , and . ACL (Findings), page 1089-1101. Association for Computational Linguistics, (2023)Towards More Realistic Chinese Spell Checking with New Benchmark and Specialized Expert Model., , , , , , , , and . LREC/COLING, page 16570-16580. ELRA and ICCL, (2024)RAM: Towards an Ever-Improving Memory System by Learning from Communications., , , and . CoRR, (2024)Generative VoxelNet: Learning Energy-Based Models for 3D Shape Synthesis and Analysis., , , , , and . CoRR, (2020)Energy-Based Generative Cooperative Saliency Prediction., , , and . CoRR, (2021)