asaddi/YALLM-LlamaVision

A set of nodes for basic Llama 3.2 Vision support in ComfyUI

/ 100

Emerging

This project helps you add image understanding to your creative workflows. You provide an image and a text query, and it generates a descriptive text response based on the image's content. It's designed for digital artists, designers, or content creators who use ComfyUI for generating and manipulating visual media and want to incorporate AI vision capabilities.

No commits in the last 6 months.

Use this if you are a ComfyUI user who wants to ask questions about an image and receive text-based answers directly within your visual workflow.

Not ideal if you need to integrate advanced AI vision into an application outside of ComfyUI or require extensive customization beyond basic image querying.

AI Art Generative Media Image Interpretation Creative Automation Visual Storytelling

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 16 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

mishushakov/llm-scraper

Turn any webpage into structured data using LLMs

Mobile-Artificial-Intelligence/maid

Maid is a free and open source application for interfacing with llama.cpp models locally, and...

run-llama/LlamaIndexTS

Data framework for your LLM applications. Focus on server side solution

JHubi1/ollama-app

A modern and easy-to-use client for Ollama

serge-chat/serge

A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.

Explore Transformer Models

All categories Trending Transformer directory Insights