robbyzhaox/myocr
A highly extensible and customizable framework for building OCR systems.
This is a framework for engineers to build and customize Optical Character Recognition (OCR) systems. It takes images or documents as input and can extract text, or even structured data like invoice details, as output. This tool is for developers and MLOps engineers who need to create specialized OCR pipelines for their specific business needs.
287 stars.
Use this if you are an engineer tasked with building a robust, custom OCR solution that needs to handle specific document types or integrate deep learning models.
Not ideal if you just need a simple, off-the-shelf OCR tool to quickly extract text from common documents without any customization or deep integration.
Stars
287
Forks
13
Language
Python
License
Apache-2.0
Category
Last pushed
Oct 27, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/robbyzhaox/myocr"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin,...
breezedeus/CnSTD
CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula...
githubharald/SimpleHTR
Handwritten Text Recognition (HTR) system implemented with TensorFlow.
felixdittrich92/OnnxTR
OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless,...
mindee/doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for...