wooyeolbaek/attention-map-diffusers

🚀 Cross attention map tools for huggingface/diffusers

/ 100

Established

This tool helps AI researchers and practitioners understand how large language models interpret text prompts when generating images. It takes a text prompt and an image generation model (like Stable Diffusion 3) as input. It then visually highlights which parts of the input text prompt influence specific regions of the generated image, outputting these as attention maps. This is useful for debugging model behavior or fine-tuning creative outputs.

397 stars. Available on PyPI.

Use this if you need to visualize the internal 'thinking process' of a text-to-image model to see how different words in your prompt contribute to specific elements in the generated image.

Not ideal if you are looking for a simple image generation tool without needing to delve into the underlying model mechanics.

AI research text-to-image generation model interpretability prompt engineering generative AI

Maintenance 10 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 14 / 25

How are scores calculated?

Stars

397

Forks

Language

Python

License

MIT

Related models

siliconflow/onediff

OneDiff: An out-of-the-box acceleration library for diffusion models.

jina-ai/discoart

🪩 Create Disco Diffusion artworks in one line

chengzeyi/stable-fast

https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace...

hkproj/pytorch-stable-diffusion

Stable Diffusion implemented from scratch in PyTorch

explainingai-code/StableDiffusion-PyTorch

This repo implements a Stable Diffusion model in PyTorch with all the essential components.

Explore Diffusion Models

All categories Trending Diffusion directory Insights