Raessan/dinov3_ros
ROS 2 integration of Meta’s DINOv3 backbone with lightweight heads for vision tasks.
This tool helps roboticists and automation engineers process camera feeds efficiently for various tasks like identifying objects, understanding scene layouts, estimating distances, and tracking motion. It takes raw image data from a robotic system's cameras and outputs processed information such as bounding boxes for detected objects, segmented areas, depth maps, or optical flow data. This is useful for anyone developing or deploying robots that need to interpret their visual environment quickly for navigation, manipulation, or interaction.
Use this if you need your robot to perform multiple vision tasks like object detection and depth estimation simultaneously, using a single camera input, without excessive computational overhead.
Not ideal if your application requires only a single, highly specialized vision task and you prioritize state-of-the-art accuracy over multi-task efficiency.
Stars
93
Forks
5
Language
Python
License
Apache-2.0
Category
Last pushed
Oct 21, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/Raessan/dinov3_ros"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
isl-org/Open3D
Open3D: A Modern Library for 3D Data Processing
cvg/Hierarchical-Localization
Visual localization made easy with hloc
gmberton/CosPlace
Official code for CVPR 2022 paper "Rethinking Visual Geo-localization for Large-Scale Applications"
Vincentqyw/image-matching-webui
🤗 image matching webui
cvg/glue-factory
Training library for local feature detection and matching