skitsanos/gemini-ocr

PDF Screenshot OCR Analysis with Google Gemini Pro

/ 100

Emerging

This project helps you quickly convert images taken from PDF documents into editable text. It takes screenshots of your PDFs as input and uses Google's advanced AI to extract the text, providing you with a clean text file and a JSON output for further use. This is perfect for anyone needing to get text out of image-based PDFs, like administrative staff, researchers, or data entry specialists.

Use this if you need to automate extracting text from many PDF screenshots for tasks like data analysis, document digitization, or making content accessible.

Not ideal if you need to extract text directly from machine-readable PDFs or if you require highly structured data extraction beyond plain text.

document-digitization data-extraction text-analysis content-accessibility research-support

No License No Package No Dependents

Maintenance 10 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

Shell

License

—

Higher-rated alternatives

vsion-x/vizgenie

Turn natural language into Grafana dashboards. Powered by AI, VizGenie auto-generates PromQL...

tanaikech/FileSearchStore-extension

This repository introduces a Gemini CLI extension that integrates File Search feature. This tool...

vojay-dev/gemini-movie-detectives-api

Use Gemini Pro LLM via VertexAI to create an engaging quiz game incorporating TMDB API data

g-hano/Smarty-Gemini

It is a sophisticated agent designed to meet a wide range of user needs through advanced...

ssabrut/gemini-data-analysis

This project leverages Google Gemini Pro, Retrieval-Augmented Generation (RAG), and Streamlit to...

Explore RAG Tools

All categories Trending RAG directory Insights