amirabbasasadi/Shotor
Free Persian Word Level OCR Dataset
This dataset helps you develop systems that can recognize individual Persian words from images. You provide digital images of Persian words, and the system you build using this data will output the corresponding text. This is useful for researchers and developers creating optical character recognition (OCR) tools specifically for the Persian language.
No commits in the last 6 months.
Use this if you need a large, free collection of Persian word images and their text labels to train or test an OCR model.
Not ideal if you need to recognize entire documents, handwriting, or words in languages other than Persian.
Stars
24
Forks
4
Language
—
License
—
Category
Last pushed
Aug 01, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/amirabbasasadi/Shotor"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
amirshnll/Persian-Swear-Words
Persian Swear Dataset - you can use in your production to filter unwanted content. دیتاست کلمات...
sajjjadayobi/PersianQA
Persian (Farsi) Question Answering Dataset (+ Models)
aghasemi/ChronologicalPersianPoetryDataset
A chronological (up to the century in which the poet has lived) of Persian poetry, extracted...
miras-tech/MirasText
MirasText
farbodbj/persian-gender-by-name
A comprehensive dataset for determining gender based on Persian names, enriched with English...