Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
This project helps creative professionals, marketers, and content creators easily transform written descriptions into various media formats. You provide text prompts, and it generates images, music, or other visual/auditory content. This is for anyone who needs to quickly prototype or produce diverse media from simple text input.
2,254 stars. No commits in the last 6 months.
Use this if you need to generate high-quality images, audio, or other media directly from text descriptions, saving time and resources in content creation.
Not ideal if you require precise, manual control over every detail of the generated output, as the focus is on broad text-to-media conversion.
Stars
2,254
Forks
95
Language
Python
License
MIT
Category
Last pushed
Feb 16, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/Alpha-VLLM/Lumina-T2X"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
filipstrand/mflux
MLX native implementations of state-of-the-art generative image models
potamides/DeTikZify
Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ.
FoundationVision/Infinity
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
zai-org/CogView
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image...
EleutherAI/DALLE-mtf
Open-AI's DALL-E for large scale training in mesh-tensorflow.