asaddi/YALLM-LlamaVision
A set of nodes for basic Llama 3.2 Vision support in ComfyUI
This project helps you add image understanding to your creative workflows. You provide an image and a text query, and it generates a descriptive text response based on the image's content. It's designed for digital artists, designers, or content creators who use ComfyUI for generating and manipulating visual media and want to incorporate AI vision capabilities.
No commits in the last 6 months.
Use this if you are a ComfyUI user who wants to ask questions about an image and receive text-based answers directly within your visual workflow.
Not ideal if you need to integrate advanced AI vision into an application outside of ComfyUI or require extensive customization beyond basic image querying.
Stars
7
Forks
2
Language
Python
License
—
Category
Last pushed
Mar 27, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/asaddi/YALLM-LlamaVision"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
mishushakov/llm-scraper
Turn any webpage into structured data using LLMs
Mobile-Artificial-Intelligence/maid
Maid is a free and open source application for interfacing with llama.cpp models locally, and...
run-llama/LlamaIndexTS
Data framework for your LLM applications. Focus on server side solution
JHubi1/ollama-app
A modern and easy-to-use client for Ollama
serge-chat/serge
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.