mlpc-ucsd/TokenCompose
(CVPR 2024) 🧩 TokenCompose: Text-to-Image Diffusion with Token-level Supervision
This project helps graphic designers and digital artists create highly realistic images from text prompts with improved accuracy for multiple distinct objects. You provide a text description, and the system generates a corresponding image, better reflecting all elements in your prompt. Anyone creating visual content from text descriptions will find this useful.
136 stars. No commits in the last 6 months.
Use this if you need to generate images from complex text prompts that involve multiple distinct objects and require high photorealism.
Not ideal if your primary concern is generating images at extremely high speeds, as enhanced accuracy might introduce a slight increase in generation time.
Stars
136
Forks
5
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Dec 21, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/mlpc-ucsd/TokenCompose"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PRIS-CV/DemoFusion
Let us democratise high-resolution generation! (CVPR 2024)
mit-han-lab/distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
Tencent-Hunyuan/HunyuanPortrait
[CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced...
giuvecchio/matfuse
MatFuse: Controllable Material Generation with Diffusion Models (CVPR2024)
Shilin-LU/TF-ICON
[ICCV 2023] "TF-ICON: Diffusion-Based Training-Free Cross-Domain Image Composition" (Official...