Author of the publication

Unsupervised Learning of Disentangled Speech Content and Style Representation.

, , , and . Interspeech, page 4089-4093. ISCA, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions., , , , , , , , , and 3 other author(s). CoRR, (2017)Hierarchical Generative Modeling for Controllable Speech Synthesis., , , , , , , , , and 2 other author(s). CoRR, (2018)Vector-quantized Image Modeling with Improved VQGAN., , , , , , , , , and . CoRR, (2021)FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization., , , , , , , , , and 1 other author(s). CoRR, (2020)Improving Streaming Automatic Speech Recognition with Non-Streaming Model Distillation on Unsupervised Data., , , , , , , , , and . ICASSP, page 6558-6562. IEEE, (2021)Scaling End-to-End Models for Large-Scale Multilingual ASR., , , , , , , , , and . ASRU, page 1011-1018. IEEE, (2021)MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training., , , , , , , , , and 22 other author(s). CoRR, (2024)Bridging the Gap Between Streaming and Non-Streaming ASR Systems by Distilling Ensembles of CTC and RNN-T Models., , , , , and . Interspeech, page 1807-1811. ISCA, (2021)EfficientDet: Scalable and Efficient Object Detection., , and . CoRR, (2019)A Better and Faster End-to-End Model for Streaming ASR., , , , , , , , , and 5 other author(s). CoRR, (2020)