From post

Recycle-and-Distill: Universal Compression Strategy for Transformer-based Speech SSL Models with Attention Map Reusing and Masking Distillation.

, , , и . INTERSPEECH, стр. 316-320. ISCA, (2023)