NVlabs/MambaVision
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
MambaVision is a tool for machine learning developers and researchers working on computer vision tasks. It processes input images and outputs classifications (e.g., "brown bear") or extracted visual features, which can then be used for more advanced tasks like object detection or semantic segmentation. This is for those building and evaluating cutting-edge computer vision models.
2,060 stars. Available on PyPI.
Use this if you are a machine learning engineer or researcher developing advanced computer vision systems and need a high-performance backbone for image classification, object detection, or semantic segmentation.
Not ideal if you are looking for an out-of-the-box application for general image editing or simple photo organization, as this requires technical expertise in machine learning frameworks.
Stars
2,060
Forks
129
Language
Python
License
—
Category
Last pushed
Mar 11, 2026
Commits (30d)
0
Dependencies
6
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/NVlabs/MambaVision"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related models
sign-language-translator/sign-language-translator
Python library & framework to build custom translators for the hearing-impaired and translate...
kyegomez/Jamba
PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"
autonomousvision/transfuser
[PAMI'23] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving;...
kyegomez/MultiModalMamba
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance...
dali92002/DocEnTR
DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022