JAMESYJL/ShapeLLM-Omni

[NeurIPS 2025 Spotlight] A Native Multimodal LLM for 3D Generation and Understanding

/ 100

Emerging

This tool helps 3D artists, designers, and researchers generate and understand 3D content using natural language. You provide text descriptions, images, or existing 3D models as input, and it outputs new 3D models or detailed analyses of 3D shapes. Anyone involved in 3D design, virtual reality, or architectural visualization could benefit from this.

549 stars.

Use this if you need to quickly create new 3D models from text prompts or gain insights into complex 3D structures without specialized modeling software.

Not ideal if you require fine-grained, precise manual control over every vertex and polygon, as it currently focuses on AI-driven generation and understanding.

3D-modeling generative-design virtual-reality-asset-creation product-design architectural-visualization

No Package No Dependents

Maintenance 6 / 25

Adoption 10 / 25

Maturity 15 / 25

Community 13 / 25

How are scores calculated?

Stars

549

Forks

Language

Python

License

MIT

Higher-rated alternatives

KimMeen/Time-LLM

[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming...

om-ai-lab/VLM-R1

Solve Visual Understanding with Reinforced VLMs

bytedance/SALMONN

SALMONN family: A suite of advanced multi-modal LLMs

NVlabs/OmniVinci

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

fixie-ai/ultravox

A fast multimodal LLM for real-time voice

Explore Transformer Models

All categories Trending Transformer directory Insights