ViT Image Classification Transformer Models

Tools and implementations for training Vision Transformers on image classification tasks across various datasets (MNIST, CIFAR-10, custom domains). Includes from-scratch implementations, fine-tuning tutorials, and comparative studies. Does NOT include vision-language models, object detection, medical imaging, 3D vision, or other downstream vision tasks beyond classification.

There are 35 vit image classification models tracked. 2 score above 50 (established tier). The highest-rated is jaehyunnn/ViTPose_pytorch at 53/100 with 125 stars.

Get all 35 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=vit-image-classification&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 jaehyunnn/ViTPose_pytorch

An unofficial implementation of ViTPose [Y. Xu et al., 2022]

53
Established
2 UdbhavPrasad072300/Transformer-Implementations

Library - Vanilla, ViT, DeiT, BERT, GPT

52
Established
3 tintn/vision-transformer-from-scratch

A Simplified PyTorch Implementation of Vision Transformer (ViT)

46
Emerging
4 icon-lab/ResViT

Official Implementation of ResViT: Residual Vision Transformers for...

46
Emerging
5 gupta-abhay/pytorch-vit

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

43
Emerging
6 NVlabs/GroupViT

Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges...

42
Emerging
7 rishikksh20/CrossViT-pytorch

Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer...

40
Emerging
8 sayakpaul/probing-vits

Probing the representations of Vision Transformers.

38
Emerging
9 all-things-vits/code-samples

Holds code for our CVPR'23 tutorial: All Things ViTs: Understanding and...

37
Emerging
10 kyegomez/MC-ViT

Implementation of the model: "(MC-ViT)" from the paper: "Memory...

37
Emerging
11 jordandeklerk/SwinViT

Modified Swin Transformer model in PyTorch on CIFAR-10 for image classification

36
Emerging
12 Orion-AI-Lab/televit

Teleconnection-driven vision transformers for improved long-term forecasting

31
Emerging
13 sayannath/ViT-TF-Hub-Application

Build and fine-tune your Image Classifier using a Vision Transformer Model...

28
Experimental
14 shub-garg/Vision-Transformer-VIT-for-MNIST

This repository implements a Vision Transformer (ViT) to classify...

27
Experimental
15 vishvaRam/Fine-Tuning-Siglip2-Vit-Model

This repository offers tools and guidance for fine-tuning the Siglip2 Vision...

26
Experimental
16 godofpdog/ViT_PyTorch

This is a simple PyTorch implementation of Vision Transformer (ViT)...

25
Experimental
17 guglielmocamporese/visual-transformer-pytorch

An easy and minimal implementation of the Visual Transformer (ViT) in...

23
Experimental
18 benisalla/Tiny-ViT-Transformer-from-scratch

This repository offers a straightforward implementation of Vision...

23
Experimental
19 wambugu71/SmartAgriImage_classification_ViT

Vision Transformer trained with thousands of agricultural diseases in...

23
Experimental
20 PRITHIVSAKTHIUR/Vit-Mature-Content-Detection

Vit-Mature-Content-Detection is an image classification vision-language...

21
Experimental
21 sergio-sanz-rodriguez/Vision-Transformers-Image-Classification

Development of Vision Transformer (ViT) networks for multi-class image...

20
Experimental
22 bikhanal/vision-transformer

Implementation of Vision Transformer (ViT) from scratch for image classification.

20
Experimental
23 Vitgracer/ViT-from-scratch

Simple minimal Vision Transformer implementation in PyTorch

19
Experimental
24 Sid7on1/ViT-Vision-Transformer

ViT-ClassiPy is a lightweight Vision Transformer built from scratch using...

18
Experimental
25 jordandeklerk/ViT

Implementing a vision transformer model in PyTorch on CIFAR-10

18
Experimental
26 jhtobigs/ViT_Survey

Vision Transformer Survey and Implementation

18
Experimental
27 conceptofmind/DeepViT-flax

Implementation of Deep Vision Transformer in Flax

17
Experimental
28 r-dug/GCViT_Classifier

Image classifier and training script, using GCViT

17
Experimental
29 AddictivelyRecursive/lightweight-multimodal-transformer-pipeline

Lightweight multimodal transformer pipeline comparing MobileViT and...

14
Experimental
30 zubairmk83/ViTP

🌟 Pretrain domain-specific models using visual instructions to enhance...

13
Experimental
31 codebywiam/visual-transformer

A deep learning project using Vision Transformer (ViT) to classify bean leaf...

13
Experimental
32 Nahom32/ViT

An implementation of the vision transformer using CIFAR-10.

13
Experimental
33 KimiaaK/vision-transformer-HuggingFace

This project utilizes ViT via HuggingFace to classify 9 strawberry diseases.

12
Experimental
34 YShokrollahi/vit-transformers-tf

This package provides an implementation of the Vision Transformer (ViT) in...

11
Experimental
35 jordanlei/vision-transformer

Image classification and generation for CIFAR10 using ViT

10
Experimental