Author of the publication

Learning auxiliary categorical information for speech synthesis based on deep and recurrent neural networks.

, , , , and . ISCSLP, page 1-5. IEEE, (2016)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Synchronous Transformers for End-to-End Speech Recognition., , , , , and . CoRR, (2019)Noise Prior Knowledge Learning for Speech Enhancement via Gated Convolutional Generative Adversarial Network., , , , , and . APSIPA, page 662-666. IEEE, (2019)Which Phonemes Will Distinguish the Different Regions Within the Same Dialect?, , , , , and . O-COCOSDA, page 152-157. IEEE, (2021)Hybrid Multi-Task Learning for End-To-End Multimodal Emotion Recognition., , , , , and . APSIPA ASC, page 1966-1971. IEEE, (2023)Learning From Yourself: A Self-Distillation Method For Fake Speech Detection., , , , , , and . ICASSP, page 1-5. IEEE, (2023)Recurrent Neural Network Based Language Model Adaptation for Accent Mandarin Speech., , , and . CCPR (2), volume 663 of Communications in Computer and Information Science, page 607-617. (2016)Forward-Backward Decoding for Regularizing End-to-End TTS., , , , , , and . INTERSPEECH, page 1283-1287. ISCA, (2019)Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations., , , , and . INTERSPEECH, page 4536-4540. ISCA, (2020)BLSTM-CRF Based End-to-End Prosodic Boundary Prediction with Context Sensitive Embeddings in a Text-to-Speech Front-End., , , and . INTERSPEECH, page 47-51. ISCA, (2018)Dynamic Speaker Representations Adjustment and Decoder Factorization for Speaker Adaptation in End-to-End Speech Synthesis., , , , , and . INTERSPEECH, page 4701-4705. ISCA, (2020)