Author of the publication

Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval.

, , and . Interspeech, page 2976-2980. ISCA, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

GRADE: Machine Learning Support for Graduate Admissions., and . IAAI, page 1479-1486. AAAI, (2013)978-1-57735-615-8.Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval., , and . Interspeech, page 2976-2980. ISCA, (2021)PaLI-X: On Scaling up a Multilingual Vision and Language Model., , , , , , , , , and 33 other author(s). CoRR, (2023)Spherical Topic Models., , , and . ICML, page 903-910. Omnipress, (2010)A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning., , , , , , , , and . CVPR, page 10813-10823. IEEE, (2023)Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO., , , , and . CoRR, (2020)Less is More: Generating Grounded Navigation Instructions from Landmarks., , , , , , , , , and . CVPR, page 15407-15417. IEEE, (2022)Leveraging Language ID in Multilingual End-to-End Speech Recognition., , , , and . ASRU, page 928-935. IEEE, (2019)Distilling Knowledge from Ensembles of Neural Networks for Speech Recognition., and . INTERSPEECH, page 3439-3443. ISCA, (2016)Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models., , , , , , , , , and 15 other author(s). CoRR, (2024)