aofrancani/DPT-VO
Dense Prediction Transformer for scale estimation in monocular visual odometry
When navigating with a single camera, such as on an autonomous vehicle or a medical robot, it's hard to precisely know your exact distance or scale from objects. This project takes a sequence of single-camera images and outputs improved depth and position estimates, making your robot's or device's movements more accurate. This is for robotics engineers or developers working on monocular visual odometry systems.
No commits in the last 6 months.
Use this if you need to improve the accuracy of scale and depth estimation in monocular visual odometry systems for applications like autonomous navigation or augmented reality.
Not ideal if you are working with stereo cameras, LiDAR, or other depth-sensing technologies, as this project specifically addresses single-camera scale ambiguity.
Stars
38
Forks
5
Language
Python
License
MIT
Category
Last pushed
Dec 07, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/aofrancani/DPT-VO"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
isl-org/Open3D
Open3D: A Modern Library for 3D Data Processing
cvg/Hierarchical-Localization
Visual localization made easy with hloc
gmberton/CosPlace
Official code for CVPR 2022 paper "Rethinking Visual Geo-localization for Large-Scale Applications"
Vincentqyw/image-matching-webui
🤗 image matching webui
cvg/glue-factory
Training library for local feature detection and matching