yousefkotp/Egyptian-Arabic-ASR-and-Diarization

The official submission from Speech Squad team for the MTC-AIC 2 competition of 2024 where an ASR model is developed tailored for the Egyptian dialect, utilizing the FastConformer architecture. Our four-stage training pipeline achieved a Mean Levenshtein Distance score of 9.58 on the test set.

37
/ 100
Emerging

This project transcribes spoken Egyptian Arabic from audio files into text. It takes an audio recording in Egyptian Arabic and outputs the corresponding text, with an option to also identify different speakers within the recording. This tool is ideal for anyone needing to convert spoken Egyptian Arabic into written form for analysis, documentation, or accessibility.

Use this if you need to accurately convert audio recordings of conversations, interviews, or broadcasts in the Egyptian Arabic dialect into written text.

Not ideal if your audio contains dialects other than Egyptian Arabic, or if you need to transcribe languages other than Arabic.

Egyptian Arabic transcription speech-to-text speaker diarization audio analysis linguistics
No Package No Dependents
Maintenance 10 / 25
Adoption 6 / 25
Maturity 16 / 25
Community 5 / 25

How are scores calculated?

Stars

17

Forks

1

Language

Jupyter Notebook

License

MIT

Last pushed

Mar 09, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/yousefkotp/Egyptian-Arabic-ASR-and-Diarization"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.