felixdittrich92/docling-OCR-OnnxTR

OnnxTR OCR plugin for Docling

41
/ 100
Emerging

This tool helps convert PDFs and other document images into editable text. It uses advanced optical character recognition (OCR) to accurately extract text from your documents, even if they're complex or scanned. The output is structured text that can be easily searched, copied, and integrated into other systems, making it ideal for researchers, data entry specialists, or anyone dealing with large volumes of digital documents.

Available on PyPI.

Use this if you need to quickly and accurately extract text from scanned documents, images, or PDFs to make them searchable or editable.

Not ideal if you primarily need to process handwritten notes or highly stylized text, as its focus is on efficiency and standard document layouts.

document-processing data-extraction digital-archiving research-automation
Maintenance 10 / 25
Adoption 6 / 25
Maturity 25 / 25
Community 0 / 25

How are scores calculated?

Stars

17

Forks

Language

Python

License

Apache-2.0

Last pushed

Mar 01, 2026

Commits (30d)

0

Dependencies

2

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/felixdittrich92/docling-OCR-OnnxTR"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.