Author of the publication

Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles.

, , , , , , , , , , , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 29441-29454. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Informedia @ TRECVID 2018: Ad-hoc Video Search, Video to Text Description, Activities in Extended video., , , , , , , , , and 9 other author(s). TRECVID, National Institute of Standards and Technology (NIST), (2018)Panoramic depth reconstruction within a single shot by optimizing global sphere radii., , , , and . SIGGRAPH ASIA Posters, page 80:1-80:2. ACM, (2018)Cognitive access in multichannel wireless networks using two-dimension Markov chain., , and . IWCMC, page 169-173. IEEE, (2014)MAViL: Masked Audio-Video Learners., , , , , , , , , and . CoRR, (2022)AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models., , , , , , , , , and 9 other author(s). CoRR, (2023)Improving What Cross-Modal Retrieval Models Learn through Object-Oriented Inter- and Intra-Modal Attention Networks., , , and . ICMR, page 244-252. ACM, (2019)RCAA: Relational Context-Aware Agents for Person Search., , , , , and . ECCV (9), volume 11213 of Lecture Notes in Computer Science, page 86-102. Springer, (2018)Generating Hashtags for Short-form Videos with Guided Signals., , , , , , , , and . ACL (1), page 9482-9495. Association for Computational Linguistics, (2023)Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles., , , , , , , , , and 3 other author(s). ICML, volume 202 of Proceedings of Machine Learning Research, page 29441-29454. PMLR, (2023)Cognitive vertical handover in heterogeneous networks., , and . QSHINE, page 392-397. IEEE, (2015)