huggingface/finetrainers

Scalable and memory-optimized training of diffusion models

/ 100

Emerging

This is a specialized library for machine learning engineers and researchers to fine-tune diffusion models. It helps take existing diffusion models and adapt them to generate specific types of images or videos by training them on new datasets. The input is a pre-trained diffusion model and your custom image/video dataset, and the output is a refined model capable of generating content aligned with your specific needs.

1,343 stars. No commits in the last 6 months.

Use this if you are a machine learning engineer or researcher looking to customize diffusion models for specific image or video generation tasks efficiently and with optimized memory usage.

Not ideal if you are looking for a no-code solution or a simple API to generate generic images/videos without needing to train custom models.

Diffusion model training Video generation Image generation Machine learning engineering Model fine-tuning

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

1,343

Forks

140

Language

Python

License

Apache-2.0

Higher-rated alternatives

NVlabs/Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

FoundationVision/VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈]...

nerdyrodent/VQGAN-CLIP

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

AssemblyAI-Community/MinImagen

MinImagen: A minimal implementation of the Imagen text-to-image model

eps696/aphantasia

CLIP + FFT/DWT/RGB = text to image/video

Explore Diffusion Models

All categories Trending Diffusion directory Insights