Author of the publication

Exploiting Audio-Visual Features with Pretrained AV-HuBERT for Multi-Modal Dysarthric Speech Reconstruction.

, , , , , , and . CoRR, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Any-to-Many Voice Conversion With Location-Relative Sequence-to-Sequence Modeling., , , , , and . IEEE ACM Trans. Audio Speech Lang. Process., (2021)Meta-Generalization for Domain-Invariant Speaker Verification., , , , , and . IEEE ACM Trans. Audio Speech Lang. Process., (2023)Neural Architecture Search for LF-MMI Trained Time Delay Neural Networks., , , , , , , , and . IEEE ACM Trans. Audio Speech Lang. Process., (2022)Hiformer: Sequence Modeling Networks With Hierarchical Attention Mechanisms., , , , , and . IEEE ACM Trans. Audio Speech Lang. Process., (2023)Automatic Speaker-level Pronunciation Assessment of L2 Speech Using Posterior Probabilities from Multiple Utterances., , , , , and . ISCSLP, page 1-5. IEEE, (2021)Unsupervised Cross-Lingual Speech Emotion Recognition Using Domain Adversarial Neural Network., , , , , and . ISCSLP, page 1-5. IEEE, (2021)Boosting the Performance of SpEx+ by Attention and Contextual Mechanism., , , , and . ISCSLP, page 135-139. IEEE, (2022)Towards Multi-Scale Style Control for Expressive Speech Synthesis., , , , , and . Interspeech, page 4673-4677. ISCA, (2021)Adversarially Learning Disentangled Speech Representations for Robust Multi-Factor Voice Conversion., , , , , and . Interspeech, page 846-850. ISCA, (2021)Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition., , , , , , , , and . Interspeech, page 4793-4797. ISCA, (2021)