FtmsdtHosseini/IDPL-PFOD

An Image Dataset of Printed Farsi Text for OCR Research

25
/ 100
Experimental

This dataset helps researchers and developers working on Optical Character Recognition (OCR) systems for the Farsi language. It provides over 30,000 artificial images of printed Farsi text, each containing a single line, with varying backgrounds and distortions. The output is a collection of image files and a corresponding CSV file detailing image information, useful for training and testing Farsi OCR models.

No commits in the last 6 months.

Use this if you are developing or evaluating OCR solutions specifically for printed Farsi text and need a diverse, structured image dataset for training or benchmarking.

Not ideal if you are looking for handwritten Farsi text, non-Farsi text, or a dataset for general image recognition tasks unrelated to text.

Farsi-language-processing text-recognition OCR-model-training document-digitization computer-vision-datasets
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 8 / 25
Community 10 / 25

How are scores calculated?

Stars

25

Forks

3

Language

License

Last pushed

Mar 15, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/FtmsdtHosseini/IDPL-PFOD"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.