opendatalab/MinerU
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
59,166 stars. Used by 1 other package. Actively maintained with 260 commits in the last 30 days. Available on PyPI.
Stars
59,166
Forks
4,913
Language
Python
License
AGPL-3.0
Last pushed
Apr 10, 2026
Commits (30d)
260
Dependencies
34
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/document-ai/opendatalab/MinerU"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
mehmet-kozan/pdf-parse
Pure TypeScript, cross-platform module for extracting text, images, and tabular data from PDFs....
HIllya51/LunaTranslator
视觉小说翻译器 / Visual Novel Translator
ShareX/ShareX
ShareX is a free and open-source application that enables users to capture or record any area of...
btwld/docling-sdk
A TypeScript SDK for Docling - Bridge between the Python Docling ecosystem and JavaScript/TypeScript.
STranslate/STranslate
A ready-to-go translation ocr tool developed with WPF/WPF 开发的一款即用即走的翻译、OCR工具