FtmsdtHosseini/IDPL-PFOD

An Image Dataset of Printed Farsi Text for OCR Research

/ 100

Experimental

This dataset helps researchers and developers working on Optical Character Recognition (OCR) systems for the Farsi language. It provides over 30,000 artificial images of printed Farsi text, each containing a single line, with varying backgrounds and distortions. The output is a collection of image files and a corresponding CSV file detailing image information, useful for training and testing Farsi OCR models.

No commits in the last 6 months.

Use this if you are developing or evaluating OCR solutions specifically for printed Farsi text and need a diverse, structured image dataset for training or benchmarking.

Not ideal if you are looking for handwritten Farsi text, non-Farsi text, or a dataset for general image recognition tasks unrelated to text.

Farsi-language-processing text-recognition OCR-model-training document-digitization computer-vision-datasets

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

—

License

—

Higher-rated alternatives

Vchitect/VBench

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

VectorSpaceLab/OmniGen

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

EndlessSora/focal-frequency-loss

[ICCV 2021] Focal Frequency Loss for Image Reconstruction and Synthesis

JIA-Lab-research/DreamOmni2

This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing...

SkyworkAI/UniPic

Open-source SOTA multi-image editing model

Explore Diffusion Models

All categories Trending Diffusion directory Insights