Author of the publication

AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation.

, , , , , , , , , , and . ACL (1), page 8590-8604. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer., , , , , , , and . EMNLP, page 15957-15969. Association for Computational Linguistics, (2023)AntCritic: Argument Mining for Free-Form and Visually-Rich Financial Comments., , , , , and . LREC/COLING, page 1306-1317. ELRA and ICCL, (2024)MEDIC: Zero-shot Music Editing with Disentangled Inversion Control., , , , , and . CoRR, (2024)Wav2SQL: Direct Generalizable Speech-To-SQL Parsing., , , , , , and . ACL (Findings), page 4230-4242. Association for Computational Linguistics, (2024)Wav2SQL: Direct Generalizable Speech-To-SQL Parsing., , , , , , and . CoRR, (2023)RMSSinger: Realistic-Music-Score based Singing Voice Synthesis., , , , , , and . ACL (Findings), page 236-248. Association for Computational Linguistics, (2023)TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation., , , , , , and . ICLR, OpenReview.net, (2023)ProDiff: Progressive Fast Diffusion Model for High-Quality Text-to-Speech., , , , , and . ACM Multimedia, page 2595-2605. ACM, (2022)AudioLCM: Text-to-Audio Generation with Latent Consistency Models., , , , , , , and . CoRR, (2024)AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation., , , , , , , , , and 1 other author(s). ACL (1), page 8590-8604. Association for Computational Linguistics, (2023)