Author of the publication

CALM: Contrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis.

, , , , , , , , and . CoRR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Relational Graph Reasoning Transformer for Image Captioning., , , and . ICME, page 1-6. IEEE, (2022)Improved Multiple-Image-Based Reflection Removal Algorithm Using Deep Neural Networks., , and . CoRR, (2022)CALM: Contrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis., , , , , , , , and . CoRR, (2023)Improved Multiple-Image-Based Reflection Removal Algorithm Using Deep Neural Networks., , and . IEEE Trans. Image Process., (2021)CALM: Constrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis., , , , , , , , and . INTERSPEECH, page 5533-5537. ISCA, (2022)Image Reflection Removal Using the Wasserstein Generative Adversarial Network., and . ICASSP, page 7695-7699. IEEE, (2019)Single-Image Reflection Removal via a Two-Stage Background Recovery Process., and . IEEE Signal Process. Lett., 26 (8): 1237-1241 (2019)Deep Music Retrieval for Fine-Grained Videos by Exploiting Cross-Modal-Encoded Voice-Overs., , , , , , , and . SIGIR, page 1880-1884. ACM, (2021)Group-Skeleton-Based Human Action Recognition in Complex Events., , and . ACM Multimedia, page 4703-4707. ACM, (2020)Super-resolution imaging with occlusion removal using a camera array., and . ISCAS, page 2487-2490. IEEE, (2016)