LeapLabTHU/DAT-Segmentation
Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention
This project provides tools for semantic segmentation, a computer vision task where you assign a label to every pixel in an image. You input images, and it outputs images where different objects or regions are precisely outlined and categorized. It's designed for computer vision researchers and engineers who work on advanced image analysis.
No commits in the last 6 months.
Use this if you need to perform highly accurate pixel-level classification of objects or regions within images, particularly for research or development of advanced computer vision models.
Not ideal if you're looking for an out-of-the-box application for general image editing or simple object recognition.
Stars
26
Forks
3
Language
Python
License
—
Category
Last pushed
Sep 07, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/LeapLabTHU/DAT-Segmentation"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
BR-IDL/PaddleViT
:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+
pathak22/unsupervised-video
[CVPR 2017] Unsupervised deep learning using unlabelled videos on the web
IBM/CrossViT
Official implementation of CrossViT. https://arxiv.org/abs/2103.14899
NVlabs/GCVit
[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers
ViTAE-Transformer/ViTDet
Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object...