yousefkotp/Egyptian-Arabic-ASR-and-Diarization

The official submission from Speech Squad team for the MTC-AIC 2 competition of 2024 where an ASR model is developed tailored for the Egyptian dialect, utilizing the FastConformer architecture. Our four-stage training pipeline achieved a Mean Levenshtein Distance score of 9.58 on the test set.

/ 100

Emerging

This project transcribes spoken Egyptian Arabic from audio files into text. It takes an audio recording in Egyptian Arabic and outputs the corresponding text, with an option to also identify different speakers within the recording. This tool is ideal for anyone needing to convert spoken Egyptian Arabic into written form for analysis, documentation, or accessibility.

Use this if you need to accurately convert audio recordings of conversations, interviews, or broadcasts in the Egyptian Arabic dialect into written text.

Not ideal if your audio contains dialects other than Egyptian Arabic, or if you need to transcribe languages other than Arabic.

Egyptian Arabic transcription speech-to-text speaker diarization audio analysis linguistics

No Package No Dependents

Maintenance 10 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

pnnbao97/VieNeu-TTS

Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio...

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

babysor/MockingBird

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

r9y9/nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

Softcatala/open-dubbing

Open dubbing is an AI dubbing system which uses machine learning models to automatically...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights