felixdittrich92/OnnxTR

OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR

60
/ 100
Established

This tool helps you convert scanned documents, images, or even web pages into editable and searchable text. You provide documents in formats like PDF or JPG, and it gives you back the extracted text, intelligently grouped into words and lines. It's designed for anyone who needs to quickly and accurately get text out of various document types.

176 stars. Used by 1 other package. Available on PyPI.

Use this if you need to extract text efficiently from documents, images, or web pages, especially if you prioritize performance and resource efficiency over needing to customize the core OCR models directly.

Not ideal if you need to fine-tune the underlying OCR models using deep learning frameworks like PyTorch or TensorFlow, as this project is a wrapper focused on deployment.

document-processing data-entry information-extraction digital-archiving text-digitization
Maintenance 10 / 25
Adoption 11 / 25
Maturity 25 / 25
Community 14 / 25

How are scores calculated?

Stars

176

Forks

18

Language

Python

License

Apache-2.0

Last pushed

Mar 09, 2026

Commits (30d)

0

Dependencies

11

Reverse dependents

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/felixdittrich92/OnnxTR"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.