ocrmypdf/OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
33,179 stars. Actively maintained with 23 commits in the last 30 days.
Stars
33,179
Forks
2,300
Language
Python
License
MPL-2.0
Last pushed
Apr 08, 2026
Commits (30d)
23
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/document-ai/ocrmypdf/OCRmyPDF"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
opendatalab/MinerU
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
mehmet-kozan/pdf-parse
Pure TypeScript, cross-platform module for extracting text, images, and tabular data from PDFs....
HIllya51/LunaTranslator
视觉小说翻译器 / Visual Novel Translator
ShareX/ShareX
ShareX is a free and open-source application that enables users to capture or record any area of...
btwld/docling-sdk
A TypeScript SDK for Docling - Bridge between the Python Docling ecosystem and JavaScript/TypeScript.