zhouchenlin2096/Awesome-Transformer-for-Vision-Recognition

A comprehensive paper list of Transformer & Attention for Vision Recognition / Foundation Model, including papers, codes, and related websites.

/ 100

Experimental

This resource provides a comprehensive list of research papers, code, and related websites focused on Transformer and Attention models for computer vision tasks and foundational models. It helps researchers and engineers stay updated on the latest advancements in image recognition, object detection, and other vision-related AI applications. Users can find the core academic work, often with direct links to implementation code, for building or improving vision-based AI systems.

No commits in the last 6 months.

Use this if you are a computer vision researcher or engineer looking for the most recent academic papers and code implementations for Transformer and Attention models in vision recognition.

Not ideal if you are looking for a beginner's guide to computer vision or pre-built, production-ready vision models for immediate deployment without deep technical understanding.

Computer Vision Research AI Model Development Image Recognition Deep Learning Foundation Models

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

—

License

—

Higher-rated alternatives

Jittor/jittor

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.

zhanghang1989/ResNeSt

ResNeSt: Split-Attention Networks

berniwal/swin-transformer-pytorch

Implementation of the Swin Transformer in PyTorch.

NVlabs/FasterViT

[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with...

ViTAE-Transformer/ViTPose

The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose...

Explore ML Frameworks

All categories Trending ML Framework directory Insights