Author of the publication

AudioTagging Done Right: 2nd comparison of deep learning methods for environmental sound classification.

, , , and . INTERSPEECH, page 1521-1525. ISCA, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Panoramic depth reconstruction within a single shot by optimizing global sphere radii., , , , and . SIGGRAPH ASIA Posters, page 80:1-80:2. ACM, (2018)Cognitive access in multichannel wireless networks using two-dimension Markov chain., , and . IWCMC, page 169-173. IEEE, (2014)Informedia @ TRECVID 2018: Ad-hoc Video Search, Video to Text Description, Activities in Extended video., , , , , , , , , and 9 other author(s). TRECVID, National Institute of Standards and Technology (NIST), (2018)AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models., , , , , , , , , and 9 other author(s). CoRR, (2023)MAViL: Masked Audio-Video Learners., , , , , , , , , and . CoRR, (2022)Improving What Cross-Modal Retrieval Models Learn through Object-Oriented Inter- and Intra-Modal Attention Networks., , , and . ICMR, page 244-252. ACM, (2019)Generating Hashtags for Short-form Videos with Guided Signals., , , , , , , , and . ACL (1), page 9482-9495. Association for Computational Linguistics, (2023)Argus: Efficient Activity Detection System for Extended Video Analysis., , , , , , , , , and 1 other author(s). WACV Workshops, page 126-133. IEEE, (2020)RCAA: Relational Context-Aware Agents for Person Search., , , , , and . ECCV (9), volume 11213 of Lecture Notes in Computer Science, page 86-102. Springer, (2018)Cognitive vertical handover in heterogeneous networks., , and . QSHINE, page 392-397. IEEE, (2015)