Author of the publication

Prosody Aware Word-Level Encoder Based on BLSTM-RNNs for DNN-Based Speech Synthesis.

, , , and . INTERSPEECH, page 764-768. ISCA, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Neural Confnet Classification: Fully Neural Network Based Spoken Utterance Classification Using Word Confusion Networks., , , , and . ICASSP, page 6039-6043. IEEE, (2018)Similar Speaker Selection Technique Based on Distance Metric Learning with Perceptual Voice Quality Similarity., , and . INTERSPEECH, page 1997-2000. ISCA, (2012)Predicting VQVAE-based Character Acting Style from Quotation-Annotated Text for Audiobook Speech Synthesis., , , , , , and . INTERSPEECH, page 4551-4555. ISCA, (2022)DNN-SPACE: DNN-HMM-Based Generative Model of Voice F0 Contours for Statistical Phrase/Accent Command Estimation., , , and . INTERSPEECH, page 1074-1078. ISCA, (2017)Investigating Effective Additional Contextual Factors in DNN-Based Spontaneous Speech Synthesis., , , , , , and . INTERSPEECH, page 3201-3205. ISCA, (2020)Enhancement of Text-Predicting Style Token With Generative Adversarial Network for Expressive Speech Synthesis., and . ICASSP, page 1-5. IEEE, (2023)Multi-Sample Subband Wavernn Via Multivariate Gaussian., and . ICASSP, page 8427-8431. IEEE, (2022)Robust Speech-Age Estimation Using Local Maximum Mean Discrepancy Under Mismatched Recording Conditions., , , , and . ASRU, page 114-121. IEEE, (2021)DNN-based Speech Synthesis Using Abundant Tags of Spontaneous Speech Corpus., , , , , , and . LREC, page 6438-6443. European Language Resources Association, (2020)Impact of Emotional State on Estimation of Willingness to Buy from Advertising Speech., , and . Interspeech, page 2486-2490. ISCA, (2021)