Author of the publication

AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation.

, , , , , , , , , , and . ACL (1), page 8590-8604. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Research on Dynamic Safe Loading Techniques in Android Application Protection System., , , , , , and . SmartCom, volume 10699 of Lecture Notes in Computer Science, page 134-143. Springer, (2017)FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models., , , , , , and . ACL (Findings), page 11655-11671. Association for Computational Linguistics, (2023)Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus., , , , , and . ACM Multimedia, page 3945-3954. ACM, (2021)StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis., , , , , , , , and . CoRR, (2023)AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head., , , , , , , , , and 4 other author(s). AAAI, page 23802-23804. AAAI Press, (2024)UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot Audio Task Learner., , , , , , , and . CoRR, (2024)MEDIC: Zero-shot Music Editing with Disentangled Inversion Control., , , , , and . CoRR, (2024)FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion., , , , , , , , , and 1 other author(s). ICML, OpenReview.net, (2024)Robust Singing Voice Transcription Serves Synthesis., , , , , and . ACL (1), page 9751-9766. Association for Computational Linguistics, (2024)Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT., , , , , , , , , and 12 other author(s). CoRR, (2024)