LeapLabTHU/DAT

Repository of Vision Transformer with Deformable Attention (CVPR2022) and DAT++: Spatially Dynamic Vision Transformerwith Deformable Attention

/ 100

Emerging

This project provides advanced computer vision models that efficiently classify images. It takes raw image data and outputs accurate classifications of objects within those images, improving upon standard methods by focusing on key visual areas. Researchers and machine learning engineers working on image analysis and recognition tasks would find this particularly useful.

925 stars. No commits in the last 6 months.

Use this if you need to perform image classification with high accuracy and efficiency, especially in scenarios where distinguishing fine details or complex scenes is crucial.

Not ideal if your primary task is object detection or image segmentation, as dedicated versions of this technology exist for those specific applications.

image-classification computer-vision machine-learning-research visual-recognition pattern-recognition

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

925

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

Jittor/jittor

Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.

berniwal/swin-transformer-pytorch

Implementation of the Swin Transformer in PyTorch.

zhanghang1989/ResNeSt

ResNeSt: Split-Attention Networks

NVlabs/FasterViT

[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with...

ViTAE-Transformer/ViTPose

The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose...

Explore ML Frameworks

All categories Trending ML Framework directory Insights