Author of the publication

Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval.

, , and . Interspeech, page 2976-2980. ISCA, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Multimodal Grounding for Sequence-to-sequence Speech Recognition., , , , and . ICASSP, page 8648-8652. IEEE, (2019)The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR., , , , , and . ICASSP, page 1-5. IEEE, (2023)Analyzing Acoustic Word Embeddings from Pre-Trained Self-Supervised Speech Models., , and . ICASSP, page 1-5. IEEE, (2023)Transforming LLMs into Cross-modal and Cross-lingual Retrieval Systems., , , , , and . CoRR, (2024)Grounding Object Detections With Transcriptions., , , and . CoRR, (2019)On the Difficulty of Segmenting Words with Attention., , and . CoRR, (2021)Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval., , and . Interspeech, page 2976-2980. ISCA, (2021)Transfer learning for multimodal dialog., , and . Comput. Speech Lang., (2020)OPERA: Operations-oriented Probabilistic Extraction, Reasoning, and Analysis., , , , , , , , , and 9 other author(s). TAC, NIST, (2019)Grounded Sequence to Sequence Transduction., , , , , , , , , and 8 other author(s). IEEE J. Sel. Top. Signal Process., 14 (3): 577-591 (2020)