yangcaoai/CoDA_NeurIPS2023
Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection
This project helps engineers, roboticists, and researchers automatically identify and locate a wide variety of objects in 3D scans of indoor environments, even if the system hasn't been explicitly trained on those specific objects. It takes 3D point cloud data and descriptive text as input, then outputs precise 3D bounding boxes and labels for all detected objects. This is ideal for professionals developing smart environments, autonomous robots, or advanced virtual reality applications.
221 stars. No commits in the last 6 months.
Use this if you need to detect and localize many different types of objects in 3D indoor scenes using point cloud data, especially when you encounter objects that were not part of your initial training data.
Not ideal if your application requires object detection in 2D images, outdoor environments, or exclusively with a fixed, predefined set of object categories.
Stars
221
Forks
16
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Sep 10, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/yangcaoai/CoDA_NeurIPS2023"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
isl-org/Open3D
Open3D: A Modern Library for 3D Data Processing
cvg/Hierarchical-Localization
Visual localization made easy with hloc
gmberton/CosPlace
Official code for CVPR 2022 paper "Rethinking Visual Geo-localization for Large-Scale Applications"
Vincentqyw/image-matching-webui
🤗 image matching webui
cvg/glue-factory
Training library for local feature detection and matching