Author of the publication

Non-Autoregressive End-to-End TTS with Coarse-to-Fine Decoding.

, , , , , and . INTERSPEECH, page 3984-3988. ISCA, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Distilling Knowledge Using Parallel Data for Far-field Speech Recognition., , , and . CoRR, (2018)Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning., , , , , and . CoRR, (2020)TSNAT: Two-Step Non-Autoregressvie Transformer Models for Speech Recognition., , , , , , and . CoRR, (2021)VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis., , , , , , , , and . Knowl. Based Syst., (January 2024)Efficient voice activity detection algorithm based on sub-band temporal envelope and sub-band long-term signal variability., , , , , and . ISCSLP, page 531-535. IEEE, (2014)Text Enhancement for Paragraph Processing in End-to-End Code-switching TTS., , , , , , and . ISCSLP, page 1-5. IEEE, (2021)Towards Fine-Grained Prosody Control for Voice Conversion., , , , and . ISCSLP, page 1-5. IEEE, (2021)A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting., , , , , , and . INTERSPEECH, page 2190-2194. ISCA, (2019)Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning., , , , and . J. Signal Process. Syst., 90 (7): 1025-1037 (2018)Investigating Efficient Feature Representation Methods and Training Objective for BLSTM-Based Phone Duration Prediction., , , , and . INTERSPEECH, page 784-788. ISCA, (2017)