maxent-ai/ocrpy

OCR, Archive, Index and Search: Implementation agnostic OCR framework.

44
/ 100
Emerging

This tool helps you convert scanned documents or images containing text into searchable and editable text files, regardless of whether they are stored locally or in cloud services like AWS or Google Cloud. It takes various document types as input and provides indexed, searchable text as output. This is ideal for anyone managing large volumes of documents, such as a records manager, legal professional, or data entry specialist.

224 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need a straightforward way to extract text from images and documents using different OCR technologies without having to learn each one's specific interface.

Not ideal if you only occasionally process a few documents or prefer to manually type out text rather than automate the process.

document-management data-entry information-retrieval digital-archiving text-extraction
Stale 6m
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 9 / 25

How are scores calculated?

Stars

224

Forks

11

Language

Jupyter Notebook

License

MIT

Last pushed

Nov 03, 2023

Commits (30d)

0

Dependencies

17

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/maxent-ai/ocrpy"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.