gnana70/tamil_ocr
OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes
This tool helps you quickly and accurately extract Tamil and English text from images taken in real-world settings, like photos of signboards, storefronts, or nameplates. You provide an image with text, and it gives you the extracted text. It's ideal for anyone needing to digitize text from natural scene photographs, such as those working with urban signage, public information, or visual data in Tamil-speaking regions.
No commits in the last 6 months.
Use this if you need to reliably convert text found in natural scene images (like signs on buildings) into digital text, especially for Tamil.
Not ideal if your primary need is to process text from scanned documents, PDFs, or other document-style images, as it lacks features like paragraph detection or skew correction for those formats.
Stars
84
Forks
15
Language
Python
License
MIT
Category
Last pushed
Sep 06, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/gnana70/tamil_ocr"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
deepdoctection/deepdoctection
A Repo For Document AI
deanmalmgren/textract
extract text from any document. no muss. no fuss.
eikek/docspell
Assist in organizing your piles of documents, resulting from scanners, e-mails and other sources...
zzzDavid/ICDAR-2019-SROIE
ICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic...