apple/ml-mdm

Train high-quality text-to-image diffusion models in a data & compute efficient manner

/ 100

Emerging

This tool helps creative professionals and researchers efficiently generate high-quality images and videos from text descriptions. You input text prompts (like "a cat in a space suit") and it outputs detailed images, up to 1024x1024 pixels. It's designed for users who need to create custom visual content without extensive manual design work.

515 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to generate realistic, high-resolution images or videos from text descriptions, especially when working with limited training data and compute resources.

Not ideal if you're looking for a simple, off-the-shelf image generator without any setup or customization, or if your primary need is for image editing rather than generation.

generative-AI digital-art content-creation visual-design AI-research

Stale 6m

Maintenance 0 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 14 / 25

How are scores calculated?

Stars

515

Forks

Language

Python

License

MIT

Higher-rated alternatives

huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

bghira/SimpleTuner

A general fine-tuning kit geared toward image/video/audio diffusion models.

mcmonkeyprojects/SwarmUI

SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an...

nateraw/stable-diffusion-videos

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts

TheDesignFounder/DreamLayer

Benchmark diffusion models faster. Automate evals, seeds, and metrics for reproducible results.

Explore Diffusion Models

All categories Trending Diffusion directory Insights