nihaljn/multimodal-prompting

Enabling the use of multiple modalities while prompting Stable Diffusion

28
/ 100
Experimental

This tool helps creative professionals like artists, designers, or marketers generate unique images using a blend of text descriptions and reference images. You provide a prompt that mixes text with placeholders for images (e.g., "A tiger taking a walk on [img]") and specify the actual images, plus how much influence each image should have. The output is a new image that combines elements from your text and all provided visual references.

No commits in the last 6 months.

Use this if you want to generate images where you precisely control the output by combining conceptual text descriptions with specific visual styles or elements from existing images.

Not ideal if you prefer to generate images solely from text prompts or if you don't have specific reference images to guide your creation.

digital-art concept-design creative-content-creation visual-asset-generation
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 6 / 25
Maturity 8 / 25
Community 14 / 25

How are scores calculated?

Stars

15

Forks

3

Language

Python

License

Last pushed

Oct 10, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/nihaljn/multimodal-prompting"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.