songweige/rich-text-to-image
Rich-Text-to-Image Generation
This tool helps graphic designers, marketers, and content creators generate detailed images from text descriptions. Instead of just a simple text prompt, you provide 'rich text'—like a document with specific fonts, colors, sizes, and footnotes. The output is a high-quality image that precisely reflects your formatting choices, giving you fine-grained control over colors, styles, and emphasis.
801 stars. No commits in the last 6 months.
Use this if you need to generate images where specific elements, colors, or styles in your text description must be accurately reflected in the final image, like a product shot with exact branding colors.
Not ideal if you're looking for a simple text-to-image generator without needing precise control over individual elements' appearance.
Stars
801
Forks
68
Language
Python
License
MIT
Category
Last pushed
Oct 09, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/songweige/rich-text-to-image"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
NVlabs/Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
FoundationVision/VAR
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈]...
nerdyrodent/VQGAN-CLIP
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
huggingface/finetrainers
Scalable and memory-optimized training of diffusion models
AssemblyAI-Community/MinImagen
MinImagen: A minimal implementation of the Imagen text-to-image model