LeapLabTHU/DAT-Segmentation

Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention

/ 100

Experimental

This project provides tools for semantic segmentation, a computer vision task where you assign a label to every pixel in an image. You input images, and it outputs images where different objects or regions are precisely outlined and categorized. It's designed for computer vision researchers and engineers who work on advanced image analysis.

No commits in the last 6 months.

Use this if you need to perform highly accurate pixel-level classification of objects or regions within images, particularly for research or development of advanced computer vision models.

Not ideal if you're looking for an out-of-the-box application for general image editing or simple object recognition.

computer-vision-research image-segmentation pixel-level-analysis deep-learning-models AI-development

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

BR-IDL/PaddleViT

:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

pathak22/unsupervised-video

[CVPR 2017] Unsupervised deep learning using unlabelled videos on the web

IBM/CrossViT

Official implementation of CrossViT. https://arxiv.org/abs/2103.14899

NVlabs/GCVit

[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers

ViTAE-Transformer/ViTDet

Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object...

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights