AnwarCS/Sudanese-Arabic-LLM

Building a Sudanese Arabic dataset and fine-tuning LLMs to improve representation of this dialect.

42
/ 100
Emerging

This project helps improve how large language models understand and generate Sudanese Arabic. It takes raw text from various sources like social media and oral stories, processes it, and then uses it to train AI models. The output is an AI that better recognizes and produces Sudanese Arabic. Language researchers, AI developers, and cultural preservationists focused on Sudanese Arabic would find this useful.

No commits in the last 6 months.

Use this if you need AI models that accurately process and generate text specifically in the Sudanese Arabic dialect.

Not ideal if your focus is on Modern Standard Arabic or other Arabic dialects, as this project is highly specialized.

Sudanese culture language preservation dialect representation AI language development linguistic research
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 6 / 25
Maturity 15 / 25
Community 19 / 25

How are scores calculated?

Stars

22

Forks

17

Language

Python

License

MIT

Category

arabic-nlp-tools

Last pushed

Jun 13, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/AnwarCS/Sudanese-Arabic-LLM"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.