ch3cook-fdu/Vote2Cap-DETR
[T-PAMI 2024] & [CVPR 2023] Vote2Cap-DETR; A set-to-set perspective towards 3D Dense Captioning; State-of-the-Art 3D Dense Captioning methods
This project helps computer vision researchers and engineers automatically describe 3D scenes. It takes 3D point cloud data of indoor environments as input and generates detailed textual captions for individual objects within that scene. This allows for automated understanding and description of complex 3D environments.
104 stars. No commits in the last 6 months.
Use this if you are working with 3D point cloud data and need to automatically identify objects and generate descriptive sentences about them, similar to how a person would describe a room.
Not ideal if your primary goal is general object detection or classification without the need for dense, natural language descriptions of specific objects in 3D space.
Stars
104
Forks
10
Language
Python
License
MIT
Category
Last pushed
Aug 17, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/computer-vision/ch3cook-fdu/Vote2Cap-DETR"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
isl-org/Open3D
Open3D: A Modern Library for 3D Data Processing
cvg/Hierarchical-Localization
Visual localization made easy with hloc
gmberton/CosPlace
Official code for CVPR 2022 paper "Rethinking Visual Geo-localization for Large-Scale Applications"
Vincentqyw/image-matching-webui
🤗 image matching webui
cvg/glue-factory
Training library for local feature detection and matching