nihaljn/multimodal-prompting

Enabling the use of multiple modalities while prompting Stable Diffusion

/ 100

Experimental

This tool helps creative professionals like artists, designers, or marketers generate unique images using a blend of text descriptions and reference images. You provide a prompt that mixes text with placeholders for images (e.g., "A tiger taking a walk on [img]") and specify the actual images, plus how much influence each image should have. The output is a new image that combines elements from your text and all provided visual references.

No commits in the last 6 months.

Use this if you want to generate images where you precisely control the output by combining conceptual text descriptions with specific visual styles or elements from existing images.

Not ideal if you prefer to generate images solely from text prompts or if you don't have specific reference images to guide your creation.

digital-art concept-design creative-content-creation visual-asset-generation

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

neggles/animatediff-cli

a CLI utility/library for AnimateDiff stable diffusion generation

sakalond/StableGen

Transform your 3D texturing workflow with the power of generative AI, directly within Blender!

victordibia/peacasso

UI interface for experimenting with multimodal (text, image) models (stable diffusion).

ai-forever/Kandinsky-2

Kandinsky 2 — multilingual text2image latent diffusion model

carefree0910/carefree-drawboard

🎨 Infinite Drawboard in Python

Explore Diffusion Models

All categories Trending Diffusion directory Insights