EthanBnntt/tinygrad-vit

A minimalist implementation of the ViT (Vision Transformer) model, using tinygrad

/ 100

Experimental

This project helps machine learning practitioners or researchers who want to understand or experiment with the core mechanics of Vision Transformers (ViT) for image classification. It takes images as input and processes them to classify what's in the image, demonstrating how a transformer architecture can be applied to visual data without complex convolutional layers. It's designed for those familiar with deep learning concepts but seeking a stripped-down, readable implementation.

No commits in the last 6 months.

Use this if you are an ML practitioner or student interested in a barebones, educational implementation of a Vision Transformer to understand its internal workings for image classification.

Not ideal if you need a production-ready, highly optimized, or feature-rich Vision Transformer for immediate deployment or large-scale tasks.

image-classification deep-learning-research computer-vision model-architecture machine-learning-education

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

Jittor/jittor

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.

berniwal/swin-transformer-pytorch

Implementation of the Swin Transformer in PyTorch.

zhanghang1989/ResNeSt

ResNeSt: Split-Attention Networks

NVlabs/FasterViT

[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with...

ViTAE-Transformer/ViTPose

The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose...

Explore ML Frameworks

All categories Trending ML Framework directory Insights