heng-hw/SpaCap3D

[IJCAI 2022] Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds (official pytorch implementation)

29
/ 100
Experimental

This project helps researchers and developers in computer vision automatically generate natural language descriptions for specific objects within 3D scans. You input 3D point cloud data, potentially with additional RGB or normal information, and it outputs concise, accurate captions describing individual objects identified in the scan. This is ideal for those working on tasks like robotic scene understanding or creating accessible descriptions of 3D environments.

No commits in the last 6 months.

Use this if you need to automatically generate detailed textual descriptions for objects found within complex 3D scans or point clouds.

Not ideal if your primary goal is general scene captioning without object-specific focus, or if you only have 2D image data.

3D-scene-understanding robotics-perception augmented-reality point-cloud-analysis computer-vision-research
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 6 / 25
Maturity 8 / 25
Community 15 / 25

How are scores calculated?

Stars

21

Forks

5

Language

Python

License

Category

image-captioning

Last pushed

Aug 31, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/heng-hw/SpaCap3D"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.