FtmsdtHosseini/IDPL-PFOD
An Image Dataset of Printed Farsi Text for OCR Research
This dataset helps researchers and developers working on Optical Character Recognition (OCR) systems for the Farsi language. It provides over 30,000 artificial images of printed Farsi text, each containing a single line, with varying backgrounds and distortions. The output is a collection of image files and a corresponding CSV file detailing image information, useful for training and testing Farsi OCR models.
No commits in the last 6 months.
Use this if you are developing or evaluating OCR solutions specifically for printed Farsi text and need a diverse, structured image dataset for training or benchmarking.
Not ideal if you are looking for handwritten Farsi text, non-Farsi text, or a dataset for general image recognition tasks unrelated to text.
Stars
25
Forks
3
Language
—
License
—
Category
Last pushed
Mar 15, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/FtmsdtHosseini/IDPL-PFOD"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Vchitect/VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
VectorSpaceLab/OmniGen
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
EndlessSora/focal-frequency-loss
[ICCV 2021] Focal Frequency Loss for Image Reconstruction and Synthesis
JIA-Lab-research/DreamOmni2
This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing...
SkyworkAI/UniPic
Open-source SOTA multi-image editing model