sajjjadayobi/PersianQA
Persian (Farsi) Question Answering Dataset (+ Models)
This project provides a comprehensive dataset for developing and testing automated question-answering systems in Persian (Farsi). It takes passages of Persian text, primarily from Wikipedia, and pairs them with questions that can be answered directly from the text, or marked as unanswerable. Researchers, linguists, or AI developers working on natural language processing for Persian can use this to train and evaluate models that understand and respond to queries in Persian.
213 stars. No commits in the last 6 months.
Use this if you need a high-quality, human-annotated dataset to build or benchmark a machine that can answer questions based on Persian text, including identifying when a question cannot be answered.
Not ideal if you are looking for an off-the-shelf system to answer questions immediately, or if your domain requires very specialized knowledge beyond general Wikipedia content.
Stars
213
Forks
17
Language
Jupyter Notebook
License
GPL-3.0
Category
Last pushed
Sep 08, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/sajjjadayobi/PersianQA"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
amirshnll/Persian-Swear-Words
Persian Swear Dataset - you can use in your production to filter unwanted content. دیتاست کلمات...
aghasemi/ChronologicalPersianPoetryDataset
A chronological (up to the century in which the poet has lived) of Persian poetry, extracted...
miras-tech/MirasText
MirasText
farbodbj/persian-gender-by-name
A comprehensive dataset for determining gender based on Persian names, enriched with English...
dml-qom/FarsTail
FarsTail: a Persian natural language inference dataset