zhouchenlin2096/Awesome-Transformer-for-Vision-Recognition
A comprehensive paper list of Transformer & Attention for Vision Recognition / Foundation Model, including papers, codes, and related websites.
This resource provides a comprehensive list of research papers, code, and related websites focused on Transformer and Attention models for computer vision tasks and foundational models. It helps researchers and engineers stay updated on the latest advancements in image recognition, object detection, and other vision-related AI applications. Users can find the core academic work, often with direct links to implementation code, for building or improving vision-based AI systems.
No commits in the last 6 months.
Use this if you are a computer vision researcher or engineer looking for the most recent academic papers and code implementations for Transformer and Attention models in vision recognition.
Not ideal if you are looking for a beginner's guide to computer vision or pre-built, production-ready vision models for immediate deployment without deep technical understanding.
Stars
20
Forks
2
Language
—
License
—
Category
Last pushed
Jul 24, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/zhouchenlin2096/Awesome-Transformer-for-Vision-Recognition"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Jittor/jittor
Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.
zhanghang1989/ResNeSt
ResNeSt: Split-Attention Networks
berniwal/swin-transformer-pytorch
Implementation of the Swin Transformer in PyTorch.
NVlabs/FasterViT
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with...
ViTAE-Transformer/ViTPose
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose...