Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis., , , , , , , , and . CoRR, (2023)DualSign: Semi-Supervised Sign Language Production with Balanced Multi-Modal Multi-Task Dual Transformation., , , and . ACM Multimedia, page 5486-5495. ACM, (2022)M4Singer: A Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus., , , , , , , , , and 1 other author(s). NeurIPS, (2022)GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation., , , , , , , , , and . CoRR, (2023)TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation., , , , , , and . CoRR, (2022)Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts., , , , , , , , , and 1 other author(s). CoRR, (2023)StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis., , , , , , , , and . AAAI, page 19597-19605. AAAI Press, (2024)Flow-Based Unconstrained Lip to Speech Generation., , , , , and . AAAI, page 843-851. AAAI Press, (2022)UniSinger: Unified End-to-End Singing Voice Synthesis With Cross-Modality Information Matching., , , , , , and . ACM Multimedia, page 7569-7579. ACM, (2023)ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer., , , , , , , and . EMNLP, page 15957-15969. Association for Computational Linguistics, (2023)