nuhmanpk/VisionScriptBot
A telegram bot that uses Google's Gemini Pro Vision API to convert image to text
This Telegram bot helps you quickly get text descriptions or answers from images. You upload an image to the bot, optionally with a text prompt as a caption, and it responds with relevant text. It's ideal for anyone who needs to extract information or generate descriptions from visual content like photos, documents, charts, or screenshots.
No commits in the last 6 months.
Use this if you need to quickly understand, summarize, or extract specific details from an image by simply sending it to a Telegram bot.
Not ideal if you need a solution for video analysis or if you prefer a standalone application over a chat-based interface.
Stars
23
Forks
9
Language
Python
License
MIT
Category
Last pushed
Jul 30, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/nuhmanpk/VisionScriptBot"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
HanaokaYuzu/Gemini-API
✨ Reverse-engineered Python API for Google Gemini web app
hihumanzone/Gemini-Discord-Bot
A Discord bot leveraging Google Gemini. Has image/video/audio recognition, conversation...
faetalize/zodiac
A ⚡lightweight⚡ frontend for Google's Gemini Pro.
Amm1rr/WebAI-to-API
Gemini to API (Don't need API KEY) (ChatGPT, Claude, DeeepSeek, Grok and more)
AOrbitron/Eridanus
基于 OneBot 协议的多功能bot兼开发框架。以llm function calling为核心构建了更智能的功能调用机制。