Author of the publication

SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems.

, , , , and . IEEE Signal Process. Lett., (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Linear Prediction-based Parallel WaveGAN Speech Synthesis., , , , , and . ICEIC, page 1-4. IEEE, (2022)Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems., , , , , , and . INTERSPEECH, page 4596-4600. ISCA, (2022)Multi-SpectroGAN: High-Diversity and High-Fidelity Spectrogram Generation with Adversarial Style Combination for Speech Synthesis., , , , and . AAAI, page 13198-13206. AAAI Press, (2021)SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems., , , , and . IEEE Signal Process. Lett., (2023)Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech., , , , and . CoRR, (2023)Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation., , , , , , and . INTERSPEECH, page 3018-3022. ISCA, (2022)Audio Dequantization for High Fidelity Audio Generation in Flow-Based Neural Vocoder., , , and . INTERSPEECH, page 3545-3549. ISCA, (2020)Cross-Lingual Transfer Learning for Phrase Break Prediction with Multilingual Language Model., , , and . CoRR, (2023)TTS-by-TTS 2: Data-Selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder., , , , , , , , and . INTERSPEECH, page 1941-1945. ISCA, (2022)