apple/ml-mdm
Train high-quality text-to-image diffusion models in a data & compute efficient manner
This tool helps creative professionals and researchers efficiently generate high-quality images and videos from text descriptions. You input text prompts (like "a cat in a space suit") and it outputs detailed images, up to 1024x1024 pixels. It's designed for users who need to create custom visual content without extensive manual design work.
515 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to generate realistic, high-resolution images or videos from text descriptions, especially when working with limited training data and compute resources.
Not ideal if you're looking for a simple, off-the-shelf image generator without any setup or customization, or if your primary need is for image editing rather than generation.
Stars
515
Forks
36
Language
Python
License
MIT
Category
Last pushed
Mar 27, 2025
Commits (30d)
0
Dependencies
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/apple/ml-mdm"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
bghira/SimpleTuner
A general fine-tuning kit geared toward image/video/audio diffusion models.
mcmonkeyprojects/SwarmUI
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an...
nateraw/stable-diffusion-videos
Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts
TheDesignFounder/DreamLayer
Benchmark diffusion models faster. Automate evals, seeds, and metrics for reproducible results.