Author of the publication

Multimodal Chain: Cross-Modal Collaboration Through Listening, Speaking, and Visualizing.

, , , and . IEEE Access, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Speech-to-Speech Translation Between Untranscribed Unknown Languages., , and . ASRU, page 593-600. IEEE, (2019)Sequence-to-Sequence ASR Optimization via Reinforcement Learning., , and . CoRR, (2017)Local Monotonic Attention Mechanism for End-to-End Speech Recognition., , and . CoRR, (2017)Multimodal Chain: Cross-Modal Collaboration Through Listening, Speaking, and Visualizing., , , and . IEEE Access, (2021)Generative Pre-training for Speech with Flow Matching., , , , , and . CoRR, (2023)Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model., , , , , , , , , and . CoRR, (2023)Local Monotonic Attention Mechanism for End-to-End Speech And Language Processing., , and . IJCNLP(1), page 431-440. Asian Federation of Natural Language Processing, (2017)Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition., , , and . INTERSPEECH, page 3835-3839. ISCA, (2019)NIX-TTS: Lightweight and End-to-End Text-to-Speech Via Module-Wise Distillation., , , , and . SLT, page 970-976. IEEE, (2022)Learning ASR Pathways: A Sparse Multilingual ASR Model., , , , , and . ICASSP, page 1-5. IEEE, (2023)