GuanRunwei/Awesome-Vision-Transformer-Collection
Variants of Vision Transformer and its downstream tasks
This is a curated collection of research papers and associated code for various Vision Transformer models. It helps researchers and engineers quickly find and understand different approaches to processing image and video data using transformer architectures. You would use this to explore the state-of-the-art in visual AI models and their applications.
257 stars. No commits in the last 6 months.
Use this if you are a researcher or AI engineer looking for a comprehensive overview of Vision Transformer models for tasks like image classification, video analysis, or point cloud processing.
Not ideal if you are looking for an off-the-shelf tool or library to directly apply Vision Transformers without deep technical understanding.
Stars
257
Forks
30
Language
—
License
—
Category
Last pushed
Jul 16, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/GuanRunwei/Awesome-Vision-Transformer-Collection"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Jittor/jittor
Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.
zhanghang1989/ResNeSt
ResNeSt: Split-Attention Networks
berniwal/swin-transformer-pytorch
Implementation of the Swin Transformer in PyTorch.
NVlabs/FasterViT
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with...
ViTAE-Transformer/ViTPose
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose...