Vision Transformer Implementations Transformer Models
Reference implementations and educational repositories of Vision Transformer architectures across frameworks (TensorFlow, PyTorch, Keras). Includes core ViT models and variants for standard vision tasks. Does NOT include specialized vision-language models, 3D vision, medical imaging, or hybrid architectures that significantly depart from standard ViT design.
There are 44 vision transformer implementations models tracked. 2 score above 50 (established tier). The highest-rated is Kohulan/DECIMER-Image_Transformer at 55/100 with 345 stars.
Get all 44 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=vision-transformer-implementations&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
Kohulan/DECIMER-Image_Transformer
DECIMER Image Transformer is a deep-learning-based tool designed for... |
|
Established |
| 2 |
sovit-123/vision_transformers
Vision Transformers for image classification, image segmentation, and object... |
|
Established |
| 3 |
fcakyon/video-transformers
Easiest way of fine-tuning HuggingFace video classification models |
|
Emerging |
| 4 |
leaderj1001/BottleneckTransformers
Bottleneck Transformers for Visual Recognition |
|
Emerging |
| 5 |
qubvel/transformers-notebooks
Inference and fine-tuning examples for vision models from 🤗 Transformers |
|
Emerging |
| 6 |
rishikksh20/convolution-vision-transformers
PyTorch Implementation of CvT: Introducing Convolutions to Vision Transformers |
|
Emerging |
| 7 |
alohays/awesome-visual-representation-learning-with-transformers
Awesome Transformers (self-attention) in Computer Vision |
|
Emerging |
| 8 |
xmindflow/Awesome-Transformer-in-Medical-Imaging
[MedIA Journal] An ultimately comprehensive paper list of Vision... |
|
Emerging |
| 9 |
mmaaz60/EdgeNeXt
[CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently... |
|
Emerging |
| 10 |
sayakpaul/robustness-vit
Contains code for the paper "Vision Transformers are Robust Learners" (AAAI 2022). |
|
Emerging |
| 11 |
adaptivetokensampling/ATS
Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral... |
|
Emerging |
| 12 |
cmhungsteve/Awesome-Transformer-Attention
An ultimately comprehensive paper list of Vision Transformer/Attention,... |
|
Emerging |
| 13 |
RLado/STB-VMM
STB-VMM: Swin Transformer Based Video Motion Magnification (official repository) |
|
Emerging |
| 14 |
ziplab/HVT
[ICCV 2021] Official implementation of "Scalable Vision Transformers with... |
|
Emerging |
| 15 |
GiannakopoulosIlias/vision-transformer-network-for-mr-electrical-properties-tomography
A 3D Vision Transformer-based neural network for reconstructing electrical... |
|
Emerging |
| 16 |
varchasvee108/vision-transformer-maze-agent
Vision Transformer agent that learns to navigate mazes while visualizing... |
|
Emerging |
| 17 |
rajatsaini0294/awesome-image-transformer
List of all the papers on Transformers for Vision. |
|
Experimental |
| 18 |
sayakpaul/vision-transformers-tf
A non-exhaustive collection of vision transformer models implemented in TensorFlow. |
|
Experimental |
| 19 |
EMalagoli92/CvT-TensorFlow
TensorFlow 2.X reimplementation of CvT: Introducing Convolutions to Vision... |
|
Experimental |
| 20 |
MingSun-Tse/Awesome-Efficient-ViT
Recent Advances on Efficient Vision Transformers |
|
Experimental |
| 21 |
danilodjor/image-retrieval-using-transformers
This repository contains code used to perform image retrieval using... |
|
Experimental |
| 22 |
chagmgang/dinov2-remote-sensing
Implementation dino v2 for remote sensing with huggingface transformers |
|
Experimental |
| 23 |
vitality-vis/vitality-vis.github.io
Promoting Serendipitous Discovery of Academic Literature with Transformers &... |
|
Experimental |
| 24 |
uakarsh/TiLT-Implementation
Implementation of the paper: Going Full-TILT Boogie on Document... |
|
Experimental |
| 25 |
revanurambareesh/instantaneous_transformer
Official repo of Instantaneous Transformers for Video based Physiology... |
|
Experimental |
| 26 |
Kotomiya07/kuzushiji-vision
くずし字認識システム |
|
Experimental |
| 27 |
jmanuelc87/vision-transformer
Implementation of different vision transformer models for classification,... |
|
Experimental |
| 28 |
tim-roderick/VST
Video Summarization Transformer: Implementation in PyTorch of the... |
|
Experimental |
| 29 |
matin-ghorbani/Video-Classification-Transformers
Implement a video classification using transformers |
|
Experimental |
| 30 |
ahmedgh970/convnext-charm
Official Tensorflow implementation of ConvNeXt-ChARM: ConvNeXt-based... |
|
Experimental |
| 31 |
koc-lab/vispool
Enhancing Transformer Encoders with Vector Visibility Graph Neural Networks... |
|
Experimental |
| 32 |
mbari-org/vitstrain
Fine-tune vision transformer models to classify Plankton, UAV(drone),... |
|
Experimental |
| 33 |
Justin900429/vision-transformer
Implement the vision transformer using pytorch |
|
Experimental |
| 34 |
nakshatrasinghh/Vision-Transformer
Tensorflow implementation of the Vision Transformer (Bye-Bye Convolutions) |
|
Experimental |
| 35 |
nachiket273/VisTrans
Implementations of transformers based models for different vision tasks |
|
Experimental |
| 36 |
curiousbrutus/fNIRS-Vise
NIRS-VIS is a Master Thesis Project for decoding visual stimuli from fNIRS... |
|
Experimental |
| 37 |
GuillaumeZahnd/vision-transformer
Vision Transformer |
|
Experimental |
| 38 |
4rtux/3D-CNN-Action-Recognition-Model
Identificación de actividades cotidianas basado en visión por computador y... |
|
Experimental |
| 39 |
CodeBy-HP/vit-pose-classifier-aws-fargate-deploy
AI-powered human pose classification API—serverless, scalable, and blazing... |
|
Experimental |
| 40 |
DaniGarciaPerez/vision_transformer
A repo to explore the implementation of a Vision Transformer from scratch... |
|
Experimental |
| 41 |
Ruhaan838/Vision-Transformer
Implementation of Vision Transformer Using PyTorch ... |
|
Experimental |
| 42 |
inuwamobarak/swin-transformers
Swin Transformers, short for "Shifted Windows," were introduced in the paper... |
|
Experimental |
| 43 |
HilaManor/CycleTransformer
Final assignment in the NLP course at the Technion (IEM097215). In this... |
|
Experimental |
| 44 |
Bessouat40/vit-transformers-finetuning
vision-transformers-finetunning |
|
Experimental |