samestrin/llm-pdf-ocr-api
A Python-based REST API for PDF OCR using AI models with PyTorch and Transformers that runs in a Docker container.
This tool helps businesses and researchers convert scanned PDF documents into editable, searchable text. You submit a PDF file, and it uses AI to extract all the text, even from images, delivering it back as a text document. This is ideal for data entry specialists, librarians, or anyone needing to digitize information from physical or scanned documents.
No commits in the last 6 months.
Use this if you need to reliably extract text from image-based PDF documents that regular copy-pasting cannot handle.
Not ideal if you primarily work with digitally native PDFs where text can already be easily selected and copied.
Stars
34
Forks
10
Language
Python
License
MIT
Category
Last pushed
May 17, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/samestrin/llm-pdf-ocr-api"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
clusterzx/paperless-ai
An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure...
kha-white/manga-ocr
Optical character recognition for Japanese text, with the main focus being Japanese manga
alephpi/Texo-web
The web application for Texo, a minimalist SOTA LaTeX OCR model which contains only 20M...
bytefer/ollama-ocr
Implementing OCR with a local visual model run by ollama.
alephpi/Texo
A minimalist SOTA LaTeX OCR model with only 20M parameters, running in browser. Full training...