ChristophReich1996/Swin-Transformer-V2
PyTorch reimplementation of the paper "Swin Transformer V2: Scaling Up Capacity and Resolution" (CVPR 2022)
This project offers a powerful tool for image recognition tasks, allowing you to build and fine-tune models that understand and categorize images. You provide a dataset of images, and it outputs a trained model capable of classifying new, unseen images. This is primarily for machine learning engineers or researchers working on advanced computer vision applications.
205 stars. No commits in the last 6 months.
Use this if you are a machine learning practitioner looking to implement or experiment with a state-of-the-art Swin Transformer V2 model for image classification or feature extraction.
Not ideal if you are an end-user without programming experience, as this is a technical library requiring Python and PyTorch knowledge.
Stars
205
Forks
19
Language
Python
License
MIT
Category
Last pushed
Sep 11, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/ChristophReich1996/Swin-Transformer-V2"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
pairlab/SlotFormer
Code release for ICLR 2023 paper: SlotFormer on object-centric dynamics models
prismformore/Multi-Task-Transformer
Code of ICLR2023 paper "TaskPrompter: Spatial-Channel Multi-Task Prompting for Dense Scene...
DirtyHarryLYL/Transformer-in-Vision
Recent Transformer-based CV and related works.
kyegomez/MegaVIT
The open source implementation of the model from "Scaling Vision Transformers to 22 Billion Parameters"
uakarsh/latr
Implementation of LaTr: Layout-aware transformer for scene-text VQA,a novel multimodal...