sail-sg/DiffMemorize

[TMLR 2025] On Memorization in Diffusion Models

/ 100

Emerging

This project provides tools for researchers and machine learning engineers to investigate how and why diffusion models 'memorize' training data. It allows you to input image datasets like CIFAR-10 or ImageNet, train diffusion models under various conditions, and then measure the extent to which these models reproduce specific training examples rather than generating novel images. The outputs are metrics and generated images that reveal memorization patterns.

No commits in the last 6 months.

Use this if you are a researcher or ML engineer studying the privacy implications, robustness, or generalization capabilities of diffusion models and need to empirically quantify memorization.

Not ideal if you are looking to simply train a diffusion model for image generation or apply one to a creative task, as this tool is focused on the analytical study of memorization.

diffusion-models model-auditing machine-learning-research generative-AI data-privacy

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

quantgirluk/aleatory

📦 Python library for Stochastic Processes Simulation and Visualisation

blei-lab/treeffuser

Treeffuser is an easy-to-use package for probabilistic prediction and probabilistic regression...

TuftsBCB/RegDiffusion

Diffusion model for gene regulatory network inference.

yuanchenyang/smalldiffusion

Simple and readable code for training and sampling from diffusion models

chairc/Integrated-Design-Diffusion-Model

IDDM (Industrial, landscape, animate, latent diffusion), support LDM, DDPM, DDIM, PLMS, webui...

Explore Diffusion Models

All categories Trending Diffusion directory Insights