Author of the publication

Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition.

, , , , , , , and . Interspeech, page 4059-4063. ISCA, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Independent Deeply Learned Matrix Analysis for Determined Audio Source Separation., , , , , , , and . IEEE ACM Trans. Audio Speech Lang. Process., 27 (10): 1601-1615 (2019)OnDA-DETR: Online Domain Adaptation for Detection Transformers with Self-Training Framework., , , , , and . ICIP, page 1780-1785. IEEE, (2023)MAPGN: Masked Pointer-Generator Network for Sequence-to-Sequence Pre-Training., , , , , and . ICASSP, page 7563-7567. IEEE, (2021)Independent deeply learned matrix analysis with automatic selection of stable microphone-wise update and fast sourcewise update of demixing matrix., , , , , , and . Signal Process., (2021)Enrollment-Less Training for Personalized Voice Activity Detection., , , , , and . Interspeech, page 346-350. ISCA, (2021)Memory Attentive Fusion: External Language Model Integration for Transformer-based Sequence-to-Sequence Model., , , , , and . INLG, page 1-6. Association for Computational Linguistics, (2020)Hierarchical Knowledge Distillation for Dialogue Sequence Labeling., , , , , , and . ASRU, page 433-440. IEEE, (2021)Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation., , , , , , and . Interspeech, page 2591-2595. ISCA, (2021)Text-to-Text Pre-Training with Paraphrasing for Improving Transformer-Based Image Captioning., , , , , and . EUSIPCO, page 516-520. IEEE, (2023)Multi-region CNN-Transformer for Micro-gesture Recognition in Face and Upper Body., , , , and . MMAsia, page 89:1-89:5. ACM, (2023)