@dblp

Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR.

, , , , and . INTERSPEECH, page 1016-1020. ISCA, (2022)

Links and resources

Tags