Author of the publication

Bi-Level Speaker Supervision for One-Shot Speech Synthesis.

, , , , , and . INTERSPEECH, page 3989-3993. ISCA, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Rnn-transducer with language bias for end-to-end Mandarin-English code-switching speech recognition., , , , and . CoRR, (2020)Emotion Selectable End-to-End Text-based Speech Editing., , , , , and . CoRR, (2022)Deep Attention Fusion Feature for Speech Separation with End-to-End Post-filter Method., , , , , and . CoRR, (2020)Gated Recurrent Fusion with Joint Training Framework for Robust End-to-End Speech Recognition., , , , , and . CoRR, (2020)Decoupling Pronunciation and Language for End-to-end Code-switching Automatic Speech Recognition., , , , , and . CoRR, (2020)Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features., , , , , and . CoRR, (2023)EmoFake: An Initial Dataset for Emotion Fake Audio Detection., , , , , , and . CoRR, (2022)SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection., , , , , , and . CoRR, (2022)Prosody and Voice Factorization for Few-Shot Speaker Adaptation in the Challenge M2voc 2021., , , , , , and . ICASSP, page 8603-8607. IEEE, (2021)Patnet : A Phoneme-Level Autoregressive Transformer Network for Speech Synthesis., , , , and . ICASSP, page 5684-5688. IEEE, (2021)