felixdittrich92/OnnxTR

OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR

/ 100

Established

This tool helps you convert scanned documents, images, or even web pages into editable and searchable text. You provide documents in formats like PDF or JPG, and it gives you back the extracted text, intelligently grouped into words and lines. It's designed for anyone who needs to quickly and accurately get text out of various document types.

176 stars. Used by 1 other package. Available on PyPI.

Use this if you need to extract text efficiently from documents, images, or web pages, especially if you prioritize performance and resource efficiency over needing to customize the core OCR models directly.

Not ideal if you need to fine-tune the underlying OCR models using deep learning frameworks like PyTorch or TensorFlow, as this project is a wrapper focused on deployment.

document-processing data-entry information-extraction digital-archiving text-digitization

Maintenance 10 / 25

Adoption 11 / 25

Maturity 25 / 25

Community 14 / 25

How are scores calculated?

Stars

176

Forks

Language

Python

License

Apache-2.0

Compare

OnnxTR and doctr

Related frameworks

JaidedAI/EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin,...

breezedeus/CnSTD

CnSTD: 基于 PyTorch/MXNet 的中文/英文场景文字检测（Scene Text Detection）、数学公式检测（Mathematical Formula...

githubharald/SimpleHTR

Handwritten Text Recognition (HTR) system implemented with TensorFlow.

mindee/doctr

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for...

parlance/ctcdecode

PyTorch CTC Decoder bindings

Explore ML Frameworks

All categories Trending ML Framework directory Insights