bhattbhavesh91/DocTR-OCR-tutorial
This repository contains a notebook to demonstrate the power of Document Text Recognition (DocTR) library
This project helps you extract text from scanned documents, images, and PDFs. It takes an image or PDF file as input and outputs the text content found within it. This is useful for anyone who needs to convert physical documents or images of text into editable or searchable digital text, such as data entry clerks, researchers, or archivists.
No commits in the last 6 months.
Use this if you need to quickly and accurately digitize text from images or non-searchable PDF documents.
Not ideal if you primarily work with already searchable digital text documents or require highly specialized handwriting recognition.
Stars
15
Forks
5
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Aug 24, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/bhattbhavesh91/DocTR-OCR-tutorial"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin,...
breezedeus/CnSTD
CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula...
githubharald/SimpleHTR
Handwritten Text Recognition (HTR) system implemented with TensorFlow.
felixdittrich92/OnnxTR
OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless,...
mindee/doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for...