Author of the publication

Zero-Shot Multi-Speaker Text-To-Speech with State-Of-The-Art Neural Speaker Embeddings.

, , , , , , and . ICASSP, page 6184-6188. IEEE, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Measuring Uncertainty in Deep Regression Models: The Case of Age Estimation from Speech., , , and . ICASSP, page 4939-4943. IEEE, (2018)Noise2Music: Text-conditioned Music Generation with Diffusion Models., , , , , , , , , and 4 other author(s). CoRR, (2023)Focus on the Present: A Regularization Method for the ASR Source-Target Attention Layer., , , and . ICASSP, page 5994-5998. IEEE, (2021)How to Estimate Model Transferability of Pre-Trained Speech Models?, , , , , , , , and . INTERSPEECH, page 456-460. ISCA, (2023)SLM: Bridge the Thin Gap Between Speech and Text Foundation Models., , , , , , , , , and 6 other author(s). ASRU, page 1-8. IEEE, (2023)The MIT Lincoln Laboratory / JHU / EPITA-LSE LRE17 System., , , , , , , , , and . Odyssey, page 54-59. ISCA, (2018)Advances in Speaker Recognition for Telephone and Audio-Visual Data: the JHU-MIT Submission for NIST SRE19., , , , , , , , , and 1 other author(s). Odyssey, page 273-280. ISCA, (2020)Focus on the present: a regularization method for the ASR source-target attention layer., , , and . CoRR, (2020)A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation., , , , , , , , and . ASRU, page 47-54. IEEE, (2021)WaveGrad: Estimating Gradients for Waveform Generation., , , , , and . ICLR, OpenReview.net, (2021)