lxe/llavavision
A simple "Be My Eyes" web app with a llama.cpp/llava backend
This web application acts like a personal assistant, describing what it "sees" through your device's camera and narrating the description aloud. It takes a live video feed or still images as input and provides an audio description of the visual content. This tool is designed for anyone who needs real-time visual assistance, similar to a 'Be My Eyes' service.
493 stars. No commits in the last 6 months.
Use this if you need an instant audio description of your surroundings or specific objects captured by your phone's camera, especially for tasks where visual information needs to be conveyed verbally.
Not ideal if you're looking for advanced image analysis, object recognition for complex scenarios, or a service that connects you with human volunteers.
Stars
493
Forks
33
Language
JavaScript
License
—
Category
Last pushed
Nov 28, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/lxe/llavavision"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
mishushakov/llm-scraper
Turn any webpage into structured data using LLMs
Mobile-Artificial-Intelligence/maid
Maid is a free and open source application for interfacing with llama.cpp models locally, and...
run-llama/LlamaIndexTS
Data framework for your LLM applications. Focus on server side solution
JHubi1/ollama-app
A modern and easy-to-use client for Ollama
serge-chat/serge
A web interface for chatting with Alpaca through llama.cpp. Fully dockerized, with an easy to use API.