samestrin/llm-pdf-ocr-api

A Python-based REST API for PDF OCR using AI models with PyTorch and Transformers that runs in a Docker container.

/ 100

Emerging

This tool helps businesses and researchers convert scanned PDF documents into editable, searchable text. You submit a PDF file, and it uses AI to extract all the text, even from images, delivering it back as a text document. This is ideal for data entry specialists, librarians, or anyone needing to digitize information from physical or scanned documents.

No commits in the last 6 months.

Use this if you need to reliably extract text from image-based PDF documents that regular copy-pasting cannot handle.

Not ideal if you primarily work with digitally native PDFs where text can already be easily selected and copied.

document-digitization data-extraction content-conversion information-retrieval pdf-processing

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

clusterzx/paperless-ai

An automated document analyzer for Paperless-ngx using OpenAI API, Ollama, Deepseek-r1, Azure...

kha-white/manga-ocr

Optical character recognition for Japanese text, with the main focus being Japanese manga

alephpi/Texo-web

The web application for Texo, a minimalist SOTA LaTeX OCR model which contains only 20M...

bytefer/ollama-ocr

Implementing OCR with a local visual model run by ollama.

alephpi/Texo

A minimalist SOTA LaTeX OCR model with only 20M parameters, running in browser. Full training...

Explore Transformer Models

All categories Trending Transformer directory Insights