lucidrains/vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

/ 100

Verified

This project offers an accessible implementation of Vision Transformers (ViT) in PyTorch, enabling practitioners to classify images with high accuracy. It takes raw image data as input and outputs classifications, indicating what objects or features are present in the image. This is for machine learning engineers and researchers looking to apply advanced vision models to their image classification tasks.

24,988 stars. Used by 2 other packages. Actively maintained with 4 commits in the last 30 days. Available on PyPI.

Use this if you are developing computer vision systems and need a flexible, state-of-the-art approach to image classification.

Not ideal if you are a beginner with no experience in Python or deep learning frameworks, as it requires coding knowledge to implement.

image-classification computer-vision deep-learning machine-learning-research

Maintenance 16 / 25

Adoption 12 / 25

Maturity 25 / 25

Community 22 / 25

How are scores calculated?

Stars

24,988

Forks

3,479

Language

Python

License

MIT

Related tools

roflcoopter/viseron

Self-hosted, local only NVR and AI Computer Vision software. With features such as object...

blakeblackshear/frigate

NVR with realtime local object detection for IP cameras

levan92/deep_sort_realtime

A really more real-time adaptation of deep sort

notAI-tech/NudeNet

Lightweight nudity detection

blakeblackshear/frigate-hass-integration

Frigate integration for Home Assistant

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights