mindspore-lab/mindocr

A toolbox of ocr models and algorithms based on MindSpore

/ 100

Established

This tool helps businesses and researchers automatically extract text from images and documents. You input images containing text, and it outputs the detected text and its location, making it easier to digitize information. It's designed for anyone needing to process visual data to get at the embedded text, such as for data entry, archival, or content analysis.

299 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to reliably detect and recognize text within various images and integrate this capability into your applications.

Not ideal if you primarily need to extract data from highly structured forms or specific document types with complex layouts, as it focuses on general text recognition.

document-digitization data-extraction image-processing content-management information-retrieval

Stale 6m

Maintenance 2 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 22 / 25

How are scores calculated?

Stars

299

Forks

Language

Python

License

Apache-2.0

Compare

mindocr and mmocr

Related frameworks

ogkalu2/comic-translate

Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a...

naptha/tesseract.js

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

mayocream/koharu

ML-powered manga translator, written in Rust.

tesseract-ocr/tesseract

Tesseract Open Source OCR Engine (main repository)

zyddnys/manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)

Explore ML Frameworks

All categories Trending ML Framework directory Insights