Author of the publication

Large-scale multimodal semantic concept detection for consumer video.

, , , , , , and . Multimedia Information Retrieval, page 255-264. ACM, (2007)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Measuring Female Representation and Impact in Films over Time., , and . Trans. Data Sci., 1 (4): 30:1-30:14 (2020)CLIP-ViP: Adapting Pre-trained Image-Text Model to Video-Language Representation Alignment., , , , , , and . CoRR, (2022)Exploiting Informative Video Segments for Temporal Action Localization., , , , and . IEEE Trans. Multim., (2022)Adaptive Siamese Tracking with a Compact Latent Network., , , , and . CoRR, (2023)End-to-end Multi-Modal Multi-Task Vehicle Control for Self-Driving Cars with Visual Perception., , , , and . CoRR, (2018)Learning Sparse 2D Temporal Adjacent Networks for Temporal Action Localization., , , , and . CoRR, (2019)Semantics-Aware Spatial-Temporal Binaries for Cross-Modal Video Retrieval., , , , and . IEEE Trans. Image Process., (2021)Interactively Co-segmentating Topically Related Images with Intelligent Scribble Guidance., , , , and . Int. J. Comput. Vis., 93 (3): 273-292 (2011)Determining Code Words in Euphemistic Hate Speech Using Word Embedding Networks., and . ALW, page 93-100. Association for Computational Linguistics, (2018)PromptCap: Prompt-Guided Image Captioning for VQA with GPT-3., , , , , and . ICCV, page 2951-2963. IEEE, (2023)