Inproceedings,

Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet.

, , , , , , , , and .
ICCV, page 538-547. IEEE, (2021)

Meta data

Tags

Users

  • @tobias.koopmann
  • @dblp

Comments and Reviews