mlpc-ucsd/TokenCompose

(CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision

/ 100

Emerging

This project helps graphic designers and digital artists create highly realistic images from text prompts with improved accuracy for multiple distinct objects. You provide a text description, and the system generates a corresponding image, better reflecting all elements in your prompt. Anyone creating visual content from text descriptions will find this useful.

136 stars. No commits in the last 6 months.

Use this if you need to generate images from complex text prompts that involve multiple distinct objects and require high photorealism.

Not ideal if your primary concern is generating images at extremely high speeds, as enhanced accuracy might introduce a slight increase in generation time.

digital-art graphic-design content-creation image-generation visual-content

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 7 / 25

How are scores calculated?

Stars

136

Forks

Language

Jupyter Notebook

License

Apache-2.0

Higher-rated alternatives

PRIS-CV/DemoFusion

Let us democratise high-resolution generation! (CVPR 2024)

mit-han-lab/distrifuser

[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models

Tencent-Hunyuan/HunyuanPortrait

[CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced...

giuvecchio/matfuse

MatFuse: Controllable Material Generation with Diffusion Models (CVPR2024)

Shilin-LU/TF-ICON

[ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official...

Explore Diffusion Models

All categories Trending Diffusion directory Insights