NVlabs/GCVit

[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers

/ 100

Emerging

This project offers an advanced technique for accurately analyzing images, helping systems recognize objects and classify scenes more effectively. It takes raw image data as input and produces highly accurate categorizations and object locations. Data scientists and machine learning engineers who develop computer vision applications will find this beneficial for improving model performance.

447 stars. No commits in the last 6 months.

Use this if you need to build or enhance computer vision models for tasks like image classification, object detection, or semantic segmentation that require state-of-the-art accuracy.

Not ideal if you are looking for a pre-built, ready-to-deploy solution without any coding or machine learning expertise.

computer-vision image-recognition object-detection machine-learning-engineering deep-learning

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

447

Forks

Language

Python

License

—

Higher-rated alternatives

BR-IDL/PaddleViT

:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

pathak22/unsupervised-video

[CVPR 2017] Unsupervised deep learning using unlabelled videos on the web

IBM/CrossViT

Official implementation of CrossViT. https://arxiv.org/abs/2103.14899

ViTAE-Transformer/ViTDet

Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object...

bytedance/SPTSv2

The official implementation of SPTS v2: Single-Point Text Spotting

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights