yousefkotp/Egyptian-Arabic-ASR-and-Diarization
The official submission from Speech Squad team for the MTC-AIC 2 competition of 2024 where an ASR model is developed tailored for the Egyptian dialect, utilizing the FastConformer architecture. Our four-stage training pipeline achieved a Mean Levenshtein Distance score of 9.58 on the test set.
This project transcribes spoken Egyptian Arabic from audio files into text. It takes an audio recording in Egyptian Arabic and outputs the corresponding text, with an option to also identify different speakers within the recording. This tool is ideal for anyone needing to convert spoken Egyptian Arabic into written form for analysis, documentation, or accessibility.
Use this if you need to accurately convert audio recordings of conversations, interviews, or broadcasts in the Egyptian Arabic dialect into written text.
Not ideal if your audio contains dialects other than Egyptian Arabic, or if you need to transcribe languages other than Arabic.
Stars
17
Forks
1
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Mar 09, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/yousefkotp/Egyptian-Arabic-ASR-and-Diarization"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
pnnbao97/VieNeu-TTS
Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio...
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
babysor/MockingBird
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
r9y9/nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
Softcatala/open-dubbing
Open dubbing is an AI dubbing system which uses machine learning models to automatically...