Author of the publication

Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining.

, , , , , and . IJCAI, page 5179-5187. ijcai.org, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Non-stationary noise estimation method based on bias-residual component decomposition for robust speech recognition., , and . ICASSP, page 4816-4819. IEEE, (2011)Weakly-Supervised Deep Recurrent Neural Networks for Basic Dance Step Generation., , , and . IJCNN, page 1-8. IEEE, (2019)An analysis of environment, microphone and data simulation mismatches in robust speech recognition., , , , and . Comput. Speech Lang., (2017)Beamforming networks using spatial covariance features for far-field speech recognition., , , and . APSIPA, page 1-6. IEEE, (2016)Segment-Level Vectorized Beam Search Based on Partially Autoregressive Inference., , , and . ASRU, page 1-8. IEEE, (2023)Domain Adaptation by Data Distribution Matching Via Submodularity For Speech Recognition., and . ASRU, page 1-7. IEEE, (2023)Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden Intermediates., , , and . ASRU, page 922-929. IEEE, (2021)Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors., , , , , and . ASRU, page 98-105. IEEE, (2021)Toward Universal Speech Enhancement For Diverse Input Conditions., , , , and . ASRU, page 1-6. IEEE, (2023)Joint Prediction and Denoising for Large-Scale Multilingual Self-Supervised Learning., , , , , , , , and . ASRU, page 1-8. IEEE, (2023)