gokayfem/ComfyUI_VLM_nodes

Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation

49
/ 100
Emerging

This project offers tools within ComfyUI to let creative professionals, artists, or marketers easily generate music from images or text, and create detailed prompts for AI art. You can input an image to get music, or provide keywords/descriptions to generate consistent or creative text prompts. It's designed for anyone working with visual or textual content who wants to explore generative AI for new creative outputs or content variations.

563 stars.

Use this if you are a ComfyUI user looking to enhance your creative workflow with advanced Vision Language Models for tasks like generating music from visuals, extracting structured data from images, or intelligently crafting prompts for other generative AI models.

Not ideal if you are looking for a standalone application for music generation or prompt creation, as this project is specifically built as custom nodes within the ComfyUI ecosystem and requires familiarity with its interface.

AI Art Music Generation Creative Workflow Prompt Engineering Content Creation
No Package No Dependents
Maintenance 6 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 17 / 25

How are scores calculated?

Stars

563

Forks

59

Language

Python

License

Apache-2.0

Last pushed

Jan 11, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/gokayfem/ComfyUI_VLM_nodes"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.