Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation.

R. Huang, H. Liu, X. Cheng, Y. Ren, L. Li, Z. Ye, J. He, L. Zhang, J. Liu, X. Yin, and Z. Zhao. ACL (1), page 8590-8604. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Rongjie Song

Josef Huang

Pa Huang

Haishi Huang

Feiqing Huang

Other publications of authors with the same name

Research on Dynamic Safe Loading Techniques in Android Application Protection System.S. Cai, R. Huang, N. Yang, J. Jiang, Z. Ming, Z. Liang, and Z. Shan. SmartCom, volume 10699 of Lecture Notes in Computer Science, page 134-143. Springer, (2017)FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models.Z. Jiang, Q. Yang, J. Zuo, Z. Ye, R. Huang, Y. Ren, and Z. Zhao. ACL (Findings), page 11655-11671. Association for Computational Linguistics, (2023)Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus.R. Huang, F. Chen, Y. Ren, J. Liu, C. Cui, and Z. Zhao. ACM Multimedia, page 3945-3954. ACM, (2021)StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis.Y. Zhang, R. Huang, R. Li, J. He, Y. Xia, F. Chen, X. Duan, B. Huai, and Z. Zhao. CoRR, (2023)AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head.R. Huang, M. Li, D. Yang, J. Shi, X. Chang, Z. Ye, Y. Wu, Z. Hong, J. Huang, J. Liu and 4 other author(s). AAAI, page 23802-23804. AAAI Press, (2024)UniAudio 1.5: Large Language Model-driven Audio Codec is A Few-shot Audio Task Learner.D. Yang, H. Guo, Y. Wang, R. Huang, X. Li, X. Tan, X. Wu, and H. Meng. CoRR, (2024)MEDIC: Zero-shot Music Editing with Disentangled Inversion Control.H. Liu, J. Wang, R. Huang, Y. Liu, J. Xu, and Z. Zhao. CoRR, (2024)FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion.Z. Wang, Z. Zhang, X. Cheng, R. Huang, L. Liu, Z. Ye, H. Huang, Y. Zhao, T. Jin, P. Gao and 1 other author(s). ICML, OpenReview.net, (2024)Robust Singing Voice Transcription Serves Synthesis.R. Li, Y. Zhang, Y. Wang, Z. Hong, R. Huang, and Z. Zhao. ACL (1), page 9751-9766. Association for Computational Linguistics, (2024)Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT.L. Zhuo, R. Du, H. Xiao, Y. Li, D. Liu, R. Huang, W. Liu, L. Zhao, F. Wang, Z. Ma and 12 other author(s). CoRR, (2024)

BibSonomy

Disambiguation of "Huang, Rongjie"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation.

Please choose a person to relate this publication to

Rongjie Song

Josef Huang

Pa Huang

Haishi Huang

Feiqing Huang

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Huang, Rongjie"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation.

Please choose a person to relate this publication to

Rongjie Song

Josef Huang

Pa Huang

Haishi Huang

Feiqing Huang

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation.