mindspore-lab/mindocr

A toolbox of ocr models and algorithms based on MindSpore

59
/ 100
Established

This tool helps businesses and researchers automatically extract text from images and documents. You input images containing text, and it outputs the detected text and its location, making it easier to digitize information. It's designed for anyone needing to process visual data to get at the embedded text, such as for data entry, archival, or content analysis.

299 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to reliably detect and recognize text within various images and integrate this capability into your applications.

Not ideal if you primarily need to extract data from highly structured forms or specific document types with complex layouts, as it focuses on general text recognition.

document-digitization data-extraction image-processing content-management information-retrieval
Stale 6m
Maintenance 2 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 22 / 25

How are scores calculated?

Stars

299

Forks

62

Language

Python

License

Apache-2.0

Category

latex-ocr-tools

Last pushed

Jul 24, 2025

Commits (30d)

0

Dependencies

27

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/mindspore-lab/mindocr"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.