Author of the publication

Classifier Architectures for Acoustic Scenes and Events: Implications for DNNs, TDNNs, and Perceptual Features from DCASE 2016.

, , , , and . IEEE ACM Trans. Audio Speech Lang. Process., 25 (6): 1304-1314 (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Unsupervised Speaker Adaptation Using Attention-Based Speaker Memory for End-to-End ASR., , , and . ICASSP, page 7384-7388. IEEE, (2020)Sequence Transduction with Graph-Based Supervision., , , and . ICASSP, page 7212-7216. IEEE, (2022)Triggered Attention for End-to-end Speech Recognition., , and . ICASSP, page 5666-5670. IEEE, (2019)SynthVSR: Scaling Up Visual Speech RecognitionWith Synthetic Supervision., , , , , , , , , and 2 other author(s). CVPR, page 18806-18815. IEEE, (2023)Streaming Audio-Visual Speech Recognition with Alignment Regularization., , , , and . INTERSPEECH, page 1598-1602. ISCA, (2023)Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training., , , and . CoRR, (2020)Extended Graph Temporal Classification for Multi-Speaker End-to-End ASR., , , , and . ICASSP, page 7322-7326. IEEE, (2022)Streaming End-to-End Speech Recognition with Joint CTC-Attention Based Models., , and . ASRU, page 936-943. IEEE, (2019)An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition., , , , and . SLT, page 324-330. IEEE, (2022)Capturing Multi-Resolution Context by Dilated Self-Attention., , and . ICASSP, page 5869-5873. IEEE, (2021)