Lhx94As/Awesome-Spoken-Language-Identification
An awesome spoken LID repository. (Working in progress
This resource helps researchers and practitioners in speech technology identify the language spoken in an audio clip. It compiles and organizes academic papers, datasets, and toolkits for spoken language identification (LID) into a single, easy-to-navigate location. If you are developing systems that need to automatically understand what language is being spoken, this repository is for you.
108 stars. No commits in the last 6 months.
Use this if you are a speech technology researcher or engineer working on spoken language identification and need a comprehensive overview of the latest research, datasets, and tools in the field.
Not ideal if you are looking for a ready-to-use, off-the-shelf solution for general language translation or transcription without needing to understand the underlying research and development.
Stars
108
Forks
9
Language
Python
License
—
Category
Last pushed
Apr 22, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Lhx94As/Awesome-Spoken-Language-Identification"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
tabahi/formantfeatures
Extract frequency, power, width and dissonance of formants from wav files
libdriver/ld3320
LD3320 full-featured driver library for general-purpose MCU and Linux.
awsaf49/audio_classification_models
Tensorflow Audio Classification Models