eniompw/GeminiOCR

Gemini API for OCR

/ 100

Emerging

This tool helps you quickly extract text from images, like scanned documents, photos of signs, or screenshots. You provide an image, and it returns the text found within it, saving you from manually typing. It's designed for anyone who needs to convert visual information into editable text.

Use this if you frequently need to get text out of images or scanned documents without retyping it manually.

Not ideal if you need highly accurate text extraction from complex layouts, handwritten notes, or documents in many different languages, as this is a simple demo.

document-management data-entry information-capture text-extraction

No Package No Dependents

Maintenance 6 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

HTML

License

MIT

Higher-rated alternatives

HanaokaYuzu/Gemini-API

✨ Reverse-engineered Python API for Google Gemini web app

faetalize/zodiac

A ⚡lightweight⚡ frontend for Google's Gemini Pro.

hihumanzone/Gemini-Discord-Bot

A Discord bot leveraging Google Gemini. Has image/video/audio recognition, conversation...

Amm1rr/WebAI-to-API

Gemini to API (Don't need API KEY) (ChatGPT, Claude, DeeepSeek, Grok and more)

GewoonJaap/gemini-cli-openai

Expose Gemini CLI endpoints as OpenAI API with Cloudflare Workers

Explore LLM Tools

All categories Trending LLM Tool directory Insights