maclong01/DeBiFormer
[ACCV 2024 ] Official code for "DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention"
DeBiFormer helps improve the accuracy and efficiency of artificial intelligence models that "see" and interpret images. It takes raw image data and processes it to identify objects, classify images into categories, or segment different parts of an image. Scientists, researchers, and developers working on advanced computer vision applications would use this for building more capable AI systems.
No commits in the last 6 months.
Use this if you are developing computer vision models for tasks like object detection, image classification, or semantic segmentation and need a highly performant and efficient backbone architecture.
Not ideal if you are looking for a plug-and-play solution for general image editing or basic photo organization without prior AI/ML model development experience.
Stars
32
Forks
2
Language
Python
License
Apache-2.0
Category
Last pushed
Jan 08, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/maclong01/DeBiFormer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
BR-IDL/PaddleViT
:robot: PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+
pathak22/unsupervised-video
[CVPR 2017] Unsupervised deep learning using unlabelled videos on the web
IBM/CrossViT
Official implementation of CrossViT. https://arxiv.org/abs/2103.14899
NVlabs/GCVit
[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers
ViTAE-Transformer/ViTDet
Unofficial implementation for [ECCV'22] "Exploring Plain Vision Transformer Backbones for Object...