Author of the publication

Improved Audio Embeddings by Adjacency-Based Clustering with Applications in Spoken Term Detection.

, , , and . CoRR, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Non-autoregressive Mandarin-English Code-switching Speech Recognition with Pinyin Mask-CTC and Word Embedding Regularization., , , and . CoRR, (2021)Maximizing Data Efficiency for Cross-Lingual TTS Adaptation by Self-Supervised Representation Mixing and Embedding Initialization., , and . CoRR, (2024)Non-Autoregressive Mandarin-English Code-Switching Speech Recognition., , , and . ASRU, page 465-472. IEEE, (2021)Phonetic-and-Semantic Embedding of Spoken words with Applications in Spoken Content Retrieval., , , , and . SLT, page 941-948. IEEE, (2018)SpeechNet: A Universal Modularized Model for Speech Processing Tasks., , , , , , , , , and . CoRR, (2021)Almost-unsupervised Speech Recognition with Close-to-zero Resource Based on Phonetic Structures Learned from Very Small Unpaired Speech and Text Data., , , , and . CoRR, (2018)Pretrained Language Model Embryology: The Birth of ALBERT., , and . EMNLP (1), page 6813-6828. Association for Computational Linguistics, (2020)Improved Audio Embeddings by Adjacency-Based Clustering with Applications in Spoken Term Detection., , , and . CoRR, (2018)Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-Speech., , and . CoRR, (2021)Few Shot Cross-Lingual TTS Using Transferable Phoneme Embedding., , , and . INTERSPEECH, page 4566-4570. ISCA, (2022)