Author of the publication

PodCastle and songle: crowdsourcing-based web services for spoken content retrieval and active music listening.

, , , , , and . CrowdMM@ACM Multimedia, page 1-2. ACM, (2012)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

LyricSynchronizer: Automatic Synchronization System Between Musical Audio Signals and Lyrics., , , and . J. Sel. Topics Signal Processing, 5 (6): 1252-1261 (2011)Detection and Separation of Speech Event Using Audio and Video Information Fusion and Its Application to Robust Speech Interface., , , , , , , and . EURASIP J. Adv. Signal Process., 2004 (11): 1727-1738 (2004)Presentation sensei: a presentation training system using speech and image processing., , , , and . ICMI, page 358-365. ACM, (2007)English call system with functions of speech segmentation and pronunciation evaluation using speech recognition technology., and . INTERSPEECH, page 1229-1232. ISCA, (2002)Multi-Self-Supervised Learning Model-Based Throat Microphone Speech Recognition., , , and . APSIPA ASC, page 1766-1770. IEEE, (2023)PodCastle and Songle: Crowdsourcing-Based Web Services for Retrieval and Browsing of Speech and Music Content., , , , , and . CrowdSearch, volume 842 of CEUR Workshop Proceedings, page 36-41. CEUR-WS.org, (2012)Applying Generative Adversarial Networks and Vision Transformers in Speech Emotion Recognition., , , and . HCI (44), volume 13519 of Lecture Notes in Computer Science, page 67-75. Springer, (2022)Podcastle: collaborative training of acoustic models on the basis of wisdom of crowds for podcast transcription., and . INTERSPEECH, page 1491-1494. ISCA, (2009)Acoustic event detection for spotting "hot spots" in podcasts., , , and . INTERSPEECH, page 1143-1146. ISCA, (2009)Podcastle: a web 2.0 approach to speech recognition research., , and . INTERSPEECH, page 2397-2400. ISCA, (2007)