Author of the publication

Align or attend? Toward More Efficient and Accurate Spoken Word Discovery Using Speech-to-Image Retrieval.

, , , , and . ICASSP, page 7603-7607. IEEE, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Optimal speech estimator considering room response as well as additive noise: Different approaches in low and high frequency range., and . ICASSP, page 4573-4576. IEEE, (2008)Training Spoken Language Understanding Systems with Non-Parallel Speech and Text., , and . ICASSP, page 8109-8113. IEEE, (2020)Language coverage for mismatched crowdsourcing., , and . ITA, page 1-9. IEEE, (2016)A Novel Vector Representation of Stochastic Signals Based on Adapted Ergodic HMMs., , and . IEEE Signal Process. Lett., 17 (8): 715-718 (2010)Semantic analysis for a speech user interface in an intelligent tutoring system., , and . IUI, page 313-315. ACM, (2004)Detecting interaction links in a collaborating group using manually annotated data., , , , and . Soc. Networks, 34 (4): 515-526 (2012)Automatic detection of auditory salience with optimized linear filters derived from human annotation., , , , and . Pattern Recognit. Lett., (2014)Dual-path Attention is All You Need for Audio-Visual Speech Extraction., , and . CoRR, (2022)Unsupervised Speech Recognition with N-Skipgram and Positional Unigram Matching., , and . CoRR, (2023)Seeing is Knowing! Fact-based Visual Question Answering using Knowledge Graph Embeddings., and . CoRR, (2020)