emedvedev/attention-ocr

A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cloud ML Engine.

51
/ 100
Established

This project helps convert text found within images into editable digital text. You provide it with images containing text and an annotation file that labels the text in each image. The system then processes these images to extract the text, which can be used for further analysis or integration into other systems. It's designed for machine learning engineers or data scientists who need to build and deploy custom optical character recognition (OCR) models.

1,085 stars. No commits in the last 6 months.

Use this if you need to train a specialized OCR model on your own unique image datasets and deploy it for real-time text extraction.

Not ideal if you're looking for an out-of-the-box OCR solution for common tasks without needing to train a custom model.

optical-character-recognition document-automation image-text-extraction machine-learning-deployment data-labeling
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

1,085

Forks

252

Language

Python

License

MIT

Last pushed

Oct 20, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/emedvedev/attention-ocr"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.