nerminnuraydogan/vision-transformer

Vision Transformer explanation and implementation with PyTorch

30
/ 100
Emerging

This project helps machine learning practitioners understand how Vision Transformers classify images. It takes an image as input and processes it by splitting it into patches, embedding them, and passing them through a Transformer Encoder. The output is the predicted class of the image, making it useful for those who build or study image recognition systems.

No commits in the last 6 months.

Use this if you are a machine learning engineer or researcher who wants to learn the inner workings and implement a Vision Transformer model for image classification.

Not ideal if you are looking for a plug-and-play image classification tool or a general-purpose computer vision library without needing to understand the model architecture.

image-classification deep-learning-education computer-vision transformer-models
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 8 / 25
Community 14 / 25

How are scores calculated?

Stars

67

Forks

9

Language

Jupyter Notebook

License

Last pushed

Nov 11, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/nerminnuraydogan/vision-transformer"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.