NVlabs/GCVit
[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers
This project offers an advanced technique for accurately analyzing images, helping systems recognize objects and classify scenes more effectively. It takes raw image data as input and produces highly accurate categorizations and object locations. Data scientists and machine learning engineers who develop computer vision applications will find this beneficial for improving model performance.
447 stars. No commits in the last 6 months.
Use this if you need to build or enhance computer vision models for tasks like image classification, object detection, or semantic segmentation that require state-of-the-art accuracy.
Not ideal if you are looking for a pre-built, ready-to-deploy solution without any coding or machine learning expertise.
Stars
447
Forks
49
Language
Python
License
—
Category
Last pushed
Dec 22, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/NVlabs/GCVit"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
BR-IDL/PaddleViT
:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+
pathak22/unsupervised-video
[CVPR 2017] Unsupervised deep learning using unlabelled videos on the web
IBM/CrossViT
Official implementation of CrossViT. https://arxiv.org/abs/2103.14899
ViTAE-Transformer/ViTDet
Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object...
bytedance/SPTSv2
The official implementation of SPTS v2: Single-Point Text Spotting