Author of the publication

Multimodal video search techniques: late fusion of speech-based retrieval and visual content-based retrieval.

, , , , , , , , and . ICASSP (3), page 1048-1051. IEEE, (2004)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Audio-Visual Speaker Recognition for Video Broadcast News., , and . VLSI Signal Processing, 29 (1-2): 71-79 (2001)User-trainable video annotation using multimodal cues., , , , , , , and . SIGIR, page 403-404. ACM, (2003)Perceptual interfaces for information interaction: joint processing of audio and visual information for human-computer interaction., , , , and . INTERSPEECH, page 11-14. ISCA, (2000)Towards speech understanding across multiple languages., , , , , and . ICSLP, ISCA, (1998)Weighting schemes for audio-visual fusion in speech recognition., , , , and . ICASSP, page 173-176. IEEE, (2001)Joint audio-visual speech processing for recognition and enhancement., , and . AVSP, page 95-104. ISCA, (2003)On the use of visual information for improving audio-based speaker recognition., , and . AVSP, page 18. ISCA, (1999)Translingual Visual Speech Synthesis., , , , and . IEEE International Conference on Multimedia and Expo (II), page 1089-1092. IEEE Computer Society, (2000)A real-time prototype for small-vocabulary audio-visual ASR., , , , , and . ICME, page 469-472. IEEE Computer Society, (2003)Assessing face and speech consistency for monologue detection in video., , and . ACM Multimedia, page 303-306. ACM, (2002)