Author of the publication

Sparse representation with temporal max-smoothing for acoustic event detection.

, , , , and . INTERSPEECH, page 1176-1180. ISCA, (2015)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Modeling spoken decision support dialogue and optimization of its dialogue strategy., , , , , , , and . ACM Trans. Speech Lang. Process., 7 (3): 10:1-10:18 (2011)Development of the "VoiceTra" Multi-Lingual Speech Translation System., , , , , , , , , and 1 other author(s). IEICE Trans. Inf. Syst., 100-D (4): 621-632 (2017)Constructing a Phonetic-Rich Speech Corpus While Controlling Time-Dependent Voice Quality Variability for English Speech Synthesis., , and . ICASSP (1), page 881-884. IEEE, (2006)Minimum segmentation error based discriminative training for speech synthesis application., , , and . ICASSP (1), page 629-632. IEEE, (2004)Discriminative training and explicit duration modeling for HMM-based automatic segmentation., , , and . Speech Commun., 47 (4): 397-410 (2005)Leveraging social Q&A collections for improving complex question answering., , , and . Comput. Speech Lang., 29 (1): 1-19 (2015)Unsupervised neural adaptation model based on optimal transport for spoken language identification., , , and . CoRR, (2020)Predicting and Attending to Damaging Collisions for Placing Everyday Objects in Photo-Realistic Simulations., , , , , , and . CoRR, (2021)Deep progressive multi-scale attention for acoustic event classification., , , , and . CoRR, (2019)CrossMap Transformer: A Crossmodal Masked Path Transformer Using Double Back-Translation for Vision-and-Language Navigation., , and . CoRR, (2021)