Author of the publication

ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding.

, , , , , , , , , , , , and . INTERSPEECH, page 5458-5462. ISCA, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks., , , , , , , , , and . CoRR, (2023)Avoid Overthinking in Self-Supervised Models for Speech Recognition., , and . CoRR, (2022)Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization., , , and . CoRR, (2023)4D ASR: Joint modeling of CTC, Attention, Transducer, and Mask-Predict decoders., , , , and . CoRR, (2022)Software Design and User Interface of ESPnet-SE++: Speech Enhancement for Robust Speech Processing., , , , , , , , , and 3 other author(s). J. Open Source Softw., 8 (91): 5403 (November 2023)Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding., , , , , and . CoRR, (2023)Align, Write, Re-Order: Explainable End-to-End Speech Translation via Operation Sequence Generation., , , , and . ICASSP, page 1-5. IEEE, (2023)Towards Zero-Shot Code-Switched Speech Recognition., , , , and . ICASSP, page 1-5. IEEE, (2023)Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation., , , , , and . INTERSPEECH, page 3533-3537. ISCA, (2022)Differentiable Allophone Graphs for Language-Universal Speech Recognition., , , , and . Interspeech, page 2471-2475. ISCA, (2021)