Qrange-group/SUR-adapter

ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities from large language models to build a high-quality textual semantic representation for text-to-image generation.

/ 100

Emerging

This project helps graphic designers, digital artists, and marketers create higher-quality, more semantically accurate images from text descriptions. It takes a plain language text prompt and, by leveraging advanced language understanding, produces a refined visual output that better matches the user's intent. Digital creators who use text-to-image generation tools will find this project useful.

120 stars. No commits in the last 6 months.

Use this if you need to improve the semantic understanding and visual quality of images generated from text prompts using pre-trained diffusion models.

Not ideal if you are looking for a completely new image generation model rather than an enhancement for existing diffusion models.

generative-art digital-illustration content-creation text-to-image AI-art

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 4 / 25

How are scores calculated?

Stars

120

Forks

Language

Python

License

MIT

Higher-rated alternatives

ljleb/sd-mecha

Executable State Dict Recipes

SJTU-DENG-Lab/Discrete-Diffusion-Forcing

Discrete Diffusion Forcing (D2F): dLLMs Can Do Faster-Than-AR Inference

declare-lab/tango

A family of diffusion models for text-to-audio generation.

Li-Jinsong/DAEDAL

[ICLR 2026] Official repository of "Beyond Fixed: Training-Free Variable-Length Denoising for...

SalesforceAIResearch/CoDA

Salesforce AI Research's open diffusion language model

Explore Diffusion Models

All categories Trending Diffusion directory Insights