Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

C2KD: Cross-Lingual Cross-Modal Knowledge Distillation for Multilingual Text-Video Retrieval., , , , , , , , , and . CoRR, (2022)What, when, and where? - Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions., , , , , , , , and . CoRR, (2023)Routing with Self-Attention for Multimodal Capsule Networks., , , , , , , , , and . CoRR, (2021)Self-Supervised Segmentation and Source Separation on Videos., , , , and . CVPR Workshops, page 0. Computer Vision Foundation / IEEE, (2019)Self-supervised Audio-visual Co-segmentation., , , , and . ICASSP, page 2357-2361. IEEE, (2019)Label-efficient audio classification through multitask learning and self-supervision., , , , and . CoRR, (2019)Cascaded Multilingual Audio-Visual Learning from Videos., , , , , , , , , and 1 other author(s). Interspeech, page 3006-3010. ISCA, (2021)Contrastive Audio-Visual Masked Autoencoder., , , , , , and . CoRR, (2022)Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos., , , , , , , , , and 3 other author(s). ICCV, page 7992-8001. IEEE, (2021)Everything at Once - Multi-modal Fusion Transformer for Video Retrieval., , , , , , , , and . CVPR, page 19988-19997. IEEE, (2022)