bahaeddinmselmi/tunisian-arabic-ai-dataset
The largest open-source dataset for Tunisian Arabic (Derja) NLP, featuring social media text, transcripts, and e-commerce data for LLM training and fine-tuning.
14
/ 100
Experimental
No License
No Package
No Dependents
Maintenance
10 / 25
Adoption
1 / 25
Maturity
3 / 25
Community
0 / 25
Stars
1
Forks
—
Language
—
License
—
Category
Last pushed
Jan 28, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/bahaeddinmselmi/tunisian-arabic-ai-dataset"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
CAMeL-Lab/camel_tools
A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York...
67
PetrKorab/Arabica
Python package for text mining of time-series data
54
markuskiller/textblob-de
German language support for TextBlob.
48
MagedSaeed/farasapy
A Python implementation of Farasa toolkit
46
adhaamehab/textblob-ar
Arabic support for textblob
45