Author of the publication

Audio-Visual End-to-End Multi-Channel Speech Separation, Dereverberation and Recognition.

, , , , , , , , and . IEEE ACM Trans. Audio Speech Lang. Process., (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Deep segmental phonetic posterior-grams based discovery of non-categories in L2 English speech., , , and . CoRR, (2020)The development of the cambridge university alignment systems for the multi-genre broadcast challenge., , , , , , , and . ASRU, page 647-653. IEEE, (2015)Language model cross adaptation for LVCSR system combination., , and . Comput. Speech Lang., 27 (4): 928-942 (2013)Use of contexts in language model interpolation and adaptation., , and . Comput. Speech Lang., 27 (1): 301-321 (2013)Automatic Complexity Control of Generalized Variable Parameter HMMs for Noise Robust Speech Recognition., , and . IEEE ACM Trans. Audio Speech Lang. Process., 23 (1): 102-114 (2015)Investigation of Data Augmentation Techniques for Disordered Speech Recognition., , , , , , and . CoRR, (2022)A Multitask Learning Framework for Speaker Change Detection with Content Information from Unsupervised Speech Decomposition., , , , , , and . ICASSP, page 8087-8091. IEEE, (2022)Mixed Precision DNN Quantization for Overlapped Speech Separation and Recognition., , , and . ICASSP, page 7297-7301. IEEE, (2022)A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition., , , , , and . ICASSP, page 1-5. IEEE, (2023)Fcl-Taco2: Towards Fast, Controllable and Lightweight Text-to-Speech Synthesis., , , , , , , and . ICASSP, page 5714-5718. IEEE, (2021)