maclong01/DeBiFormer

[ACCV 2024 ] Official code for "DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention"

/ 100

Experimental

DeBiFormer helps improve the accuracy and efficiency of artificial intelligence models that "see" and interpret images. It takes raw image data and processes it to identify objects, classify images into categories, or segment different parts of an image. Scientists, researchers, and developers working on advanced computer vision applications would use this for building more capable AI systems.

No commits in the last 6 months.

Use this if you are developing computer vision models for tasks like object detection, image classification, or semantic segmentation and need a highly performant and efficient backbone architecture.

Not ideal if you are looking for a plug-and-play solution for general image editing or basic photo organization without prior AI/ML model development experience.

computer-vision image-recognition object-detection semantic-segmentation image-classification

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 6 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

BR-IDL/PaddleViT

:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

pathak22/unsupervised-video

[CVPR 2017] Unsupervised deep learning using unlabelled videos on the web

IBM/CrossViT

Official implementation of CrossViT. https://arxiv.org/abs/2103.14899

NVlabs/GCVit

[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers

ViTAE-Transformer/ViTDet

Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object...

Explore Computer Vision Tools

All categories Trending Computer Vision directory Insights