Author of the publication

VCSE: Time-Domain Visual-Contextual Speaker Extraction Network.

, , , , and . INTERSPEECH, page 906-910. ISCA, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Estimation of vocal tract shapes from speech sounds with a physiological articulatory model., and . J. Phonetics, 30 (3): 511-532 (2002)Speech Emotion Recognition Considering Local Dynamic Features., , , , and . CoRR, (2018)Monolingual Recognizers Fusion for Code-switching Speech Recognition., , , , , , , and . CoRR, (2022)Robust Environmental Sound Recognition with Sparse Key-point Encoding and Efficient Multi-spike Learning., , , , , and . CoRR, (2019)Investigation of the relation between acoustic features and articulation - An application to emotional speech analysis., , and . ISCSLP, page 326-329. IEEE, (2010)Deeper Multiscale Encoding-Decoding Feature Fusion Network for Change Detection of VHR Images., , , , and . IEEE Geosci. Remote. Sens. Lett., (2023)Improving low-resource Tibetan end-to-end ASR by multilingual and multilevel unit modeling., , , , and . EURASIP J. Audio Speech Music. Process., 2022 (1): 2 (2022)基于冗余小波变换与引导滤波的多聚焦图像融合 (Multi-focus Image Fusion Based on Redundant Wavelet Transform and Guided Filtering)., , , and . 计算机科学, 45 (2): 301-305 (2018)Vowel Production Manifold: Intrinsic Factor Analysis of Vowel Articulation., and . IEEE Trans. Speech Audio Process., 18 (5): 1053-1062 (2010)Constructing Accurate and Efficient Deep Spiking Neural Networks With Double-Threshold and Augmented Schemes., , , , , and . IEEE Trans. Neural Networks Learn. Syst., 33 (4): 1714-1726 (2022)