Description
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch - lucidrains/vit-pytorch
Summary
- README.md Vision Transformer - Pytorch Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch.
- Significance is further explained in Yannic Kilcher's video.
- For a Pytorch implementation with pretrained models, please see Ross Wightman's repository here Distillation A recent paper has shown that use of a distillation token for distilling knowledge from convolutional nets to vision transformer can yield small and efficient vision transformers.
- A pytorch-lightning script is ready for you to use at the repository link above.