nerminnuraydogan/vision-transformer

Vision Transformer explanation and implementation with PyTorch

/ 100

Emerging

This project helps machine learning practitioners understand how Vision Transformers classify images. It takes an image as input and processes it by splitting it into patches, embedding them, and passing them through a Transformer Encoder. The output is the predicted class of the image, making it useful for those who build or study image recognition systems.

No commits in the last 6 months.

Use this if you are a machine learning engineer or researcher who wants to learn the inner workings and implement a Vision Transformer model for image classification.

Not ideal if you are looking for a plug-and-play image classification tool or a general-purpose computer vision library without needing to understand the model architecture.

image-classification deep-learning-education computer-vision transformer-models

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

Jittor/jittor

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.

berniwal/swin-transformer-pytorch

Implementation of the Swin Transformer in PyTorch.

zhanghang1989/ResNeSt

ResNeSt: Split-Attention Networks

NVlabs/FasterViT

[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with...

ViTAE-Transformer/ViTPose

The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose...

Explore ML Frameworks

All categories Trending ML Framework directory Insights