Author of the publication

FastDiff 2: Revisiting and Incorporating GANs and Diffusion Models in High-Fidelity Speech Synthesis.

, , , , , and . ACL (Findings), page 6994-7009. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models., , , , , , and . ACL (Findings), page 11655-11671. Association for Computational Linguistics, (2023)A Robotic Communication Middleware Combining High Performance and High Reliability., , , , and . SBAC-PAD, page 217-224. IEEE, (2020)Zero-shot Explainable Mental Health Analysis on Social Media by Incorporating Mental Scales., , , , , and . WWW (Companion Volume), page 959-962. ACM, (2024)Zoro: A robotic middleware combining high performance and high reliability., , , , , and . J. Parallel Distributed Comput., (2022)Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias., , , , , , , , , and 2 other author(s). CoRR, (2023)Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis., , , , , , , and . CoRR, (2023)Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis., , , , , , , , , and 4 other author(s). CoRR, (2024)Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech., , , , , , and . CoRR, (2022)FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency., , , and . CoRR, (2023)GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis., , , , , and . ICLR, OpenReview.net, (2023)