Vision Transformer - Pytorch

在 Pytorch 中实现 Vision Transformer,这是一种在视觉分类中实现 SOTA 的简单方法,只需使用单个 transformer 编码器。「Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch.」

Main metrics

Overview

Name With Ownerlucidrains/vit-pytorch
Primary LanguagePython
Program languagePython (Language Count: 1)
PlatformLinux, Mac, Windows
License:MIT License
Release Count238
Last Release Name1.14.5 (Posted on )
First Release Name0.0.1 (Posted on )
Created At2020-10-03 22:47:24
Pushed At2025-10-24 21:00:44
Last Commit At
Stargazers Count24259
Watchers Count158
Fork Count3417
Commits Count359
Has Issues Enabled
Issues Count274
Issue Open Count130
Pull Requests Count38
Pull Requests Open Count12
Pull Requests Close Count11
Has Wiki Enabled
Is Archived
Is Fork
Is Locked
Is Mirror
Is Private
To the top