Author of the publication

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

, , , , , , , , , , , and . (2020)cite arxiv:2010.11929Comment: Fine-tuning code and pre-trained models are available at https://github.com/google-research/vision_transformer. ICLR camera-ready version with 2 small modifications: 1) Added a discussion of CLS vs GAP classifier in the appendix, 2) Fixed an error in exaFLOPs computation in Figure 5 and Table 6 (relative performance of models is basically not affected).

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Scaling Vision Transformers to 22 Billion Parameters., , , , , , , , , and 32 other author(s). CoRR, (2023)Revisiting the Calibration of Modern Neural Networks., , , , , , , and . NeurIPS, page 15682-15694. (2021)Simple Open-Vocabulary Object Detection with Vision Transformers., , , , , , , , , and 4 other author(s). CoRR, (2022)SCENIC: A JAX Library for Computer Vision Research and Beyond., , , , and . CoRR, (2021)Scaling Vision Transformers to 22 Billion Parameters., , , , , , , , , and 32 other author(s). ICML, volume 202 of Proceedings of Machine Learning Research, page 7480-7512. PMLR, (2023)FlexiViT: One Model for All Patch Sizes., , , , , , , , , and . CVPR, page 14496-14506. IEEE, (2023)Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution., , , , , , , , , and 5 other author(s). CoRR, (2023)On Robustness and Transferability of Convolutional Neural Networks., , , , , , , , , and 4 other author(s). CVPR, page 16458-16468. Computer Vision Foundation / IEEE, (2021)Denoising Pretraining for Semantic Segmentation., , , , , and . CVPR Workshops, page 4174-4185. IEEE, (2022)Simple Open-Vocabulary Object Detection., , , , , , , , , and 4 other author(s). ECCV (10), volume 13670 of Lecture Notes in Computer Science, page 728-755. Springer, (2022)