Author of the publication

Large-scale multimodal semantic concept detection for consumer video.

, , , , , , and . Multimedia Information Retrieval, page 255-264. ACM, (2007)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Compressed-domain techniques for image/video indexing and manipulation.. ICIP, page 314-317. IEEE Computer Society, (1995)Local color and texture extraction and spatial query., and . ICIP (3), page 1011-1014. IEEE Computer Society, (1996)Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos., , , , , , , , , and 3 other author(s). ICCV, page 7992-8001. IEEE, (2021)Learning with Partially Absorbing Random Walks., , , , and . NIPS, page 3086-3094. (2012)Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners., , , , , , , , , and 3 other author(s). NeurIPS, (2022)Understanding ME? Multimodal Evaluation for Fine-grained Visual Commonsense., , , , , and . EMNLP, page 9212-9224. Association for Computational Linguistics, (2022)Weakly-Supervised Temporal Article Grounding., , , , , , , , and . EMNLP, page 9402-9413. Association for Computational Linguistics, (2022)UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding., , , , , and . ACL (Findings), page 778-793. Association for Computational Linguistics, (2023)Non-Sequential Graph Script Induction via Multimedia Grounding., , , , , , and . ACL (1), page 5529-5545. Association for Computational Linguistics, (2023)A Multi-media Approach to Cross-lingual Entity Knowledge Transfer., , , , , and . ACL (1), The Association for Computer Linguistics, (2016)