BobMcDear/vit-pytorch

PyTorch implementation of the vision transformer

/ 100

Experimental

This project helps machine learning engineers or researchers implement a Vision Transformer (ViT) architecture in PyTorch. It takes raw image data as input and outputs classifications based on the visual content. The end-user is a deep learning practitioner working on computer vision tasks.

No commits in the last 6 months.

Use this if you are a machine learning engineer or researcher building custom computer vision models and need a flexible, modular PyTorch implementation of the Vision Transformer.

Not ideal if you are looking for a pre-trained model or a high-level API for immediate image classification without diving into model architecture details.

computer-vision deep-learning image-classification neural-networks model-building

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

GPL-3.0

Higher-rated alternatives

Jittor/jittor

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.

berniwal/swin-transformer-pytorch

Implementation of the Swin Transformer in PyTorch.

zhanghang1989/ResNeSt

ResNeSt: Split-Attention Networks

NVlabs/FasterViT

[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with...

ViTAE-Transformer/ViTPose

The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose...

Explore ML Frameworks

All categories Trending ML Framework directory Insights