KadenMc/PreprocessingHTR
Pre-processing a handwritten page into word images for Handwritten Text Recognition (HTR).
This tool helps researchers, historians, and archivists convert scanned images of handwritten pages into individual word images, making them ready for Handwritten Text Recognition (HTR) systems. You input a full, clear image of a handwritten page, and it outputs separate images for each word found on the page. It's designed for anyone working with historical documents or large collections of handwritten text who needs to digitize content.
No commits in the last 6 months.
Use this if you need to prepare scanned handwritten documents for automated text recognition, specifically by extracting individual word images.
Not ideal if your handwritten pages are heavily warped, have overlapping text lines, or contain less than perfect lighting and page borders, as it relies on clear page structure.
Stars
31
Forks
4
Language
Python
License
MIT
Category
Last pushed
Dec 16, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/KadenMc/PreprocessingHTR"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
JaidedAI/EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin,...
breezedeus/CnSTD
CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula...
githubharald/SimpleHTR
Handwritten Text Recognition (HTR) system implemented with TensorFlow.
felixdittrich92/OnnxTR
OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless,...
mindee/doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for...