nuhmanpk/VisionScriptBot

A telegram bot that uses Google's Gemini Pro Vision API to convert image to text

/ 100

Emerging

This Telegram bot helps you quickly get text descriptions or answers from images. You upload an image to the bot, optionally with a text prompt as a caption, and it responds with relevant text. It's ideal for anyone who needs to extract information or generate descriptions from visual content like photos, documents, charts, or screenshots.

No commits in the last 6 months.

Use this if you need to quickly understand, summarize, or extract specific details from an image by simply sending it to a Telegram bot.

Not ideal if you need a solution for video analysis or if you prefer a standalone application over a chat-based interface.

image-to-text document-analysis visual-search content-understanding information-extraction

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

HanaokaYuzu/Gemini-API

✨ Reverse-engineered Python API for Google Gemini web app

hihumanzone/Gemini-Discord-Bot

A Discord bot leveraging Google Gemini. Has image/video/audio recognition, conversation...

faetalize/zodiac

A ⚡lightweight⚡ frontend for Google's Gemini Pro.

Amm1rr/WebAI-to-API

Gemini to API (Don't need API KEY) (ChatGPT, Claude, DeeepSeek, Grok and more)

AOrbitron/Eridanus

基于 OneBot 协议的多功能bot兼开发框架。以llm function calling为核心构建了更智能的功能调用机制。

Explore LLM Tools

All categories Trending LLM Tool directory Insights