Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Bi-Level Style and Prosody Decoupling Modeling for Personalized End-to-End Speech Synthesis., , , , , and . ICASSP, page 6568-6572. IEEE, (2021)Prosody and Voice Factorization for Few-Shot Speaker Adaptation in the Challenge M2voc 2021., , , , , , and . ICASSP, page 8603-8607. IEEE, (2021)Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding., , , , , , , and . CoRR, (2023)Dynamic Soft Windowing and Language Dependent Style Token for Code-Switching End-to-End Speech Synthesis., , , , , and . INTERSPEECH, page 2937-2941. ISCA, (2020)Text Enhancement for Paragraph Processing in End-to-End Code-switching TTS., , , , , , and . CoRR, (2022)Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation., , , , , and . CoRR, (2022)Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis., , , , and . ISCSLP, page 61-65. IEEE, (2022)Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis., , , , , and . ICASSP, page 1-5. IEEE, (2023)High-Fidelity Speech Synthesis with Minimal Supervision: All Using Diffusion Models., , , , , , and . CoRR, (2023)Learning Speech Representation From Contrastive Token-Acoustic Pretraining., , , , , , and . CoRR, (2023)