shitian-ni/speech-recognition-transfer-learning
Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow
This project helps you quickly build a system to recognize specific spoken commands, like "yes," "no," or "stop." It takes existing audio recordings of everyday sounds and adapts that knowledge to understand new short voice commands. Anyone working on voice-controlled devices, accessibility tools, or interactive voice response systems would find this useful.
No commits in the last 6 months.
Use this if you need to train a speech command recognition system using a relatively small dataset of voice commands, leveraging a pre-existing, larger dataset of general urban sounds.
Not ideal if you're building a system for transcribing full sentences or need to recognize speech in a language not represented in the available transfer learning datasets.
Stars
17
Forks
5
Language
Python
License
—
Category
Last pushed
Jan 19, 2018
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/shitian-ni/speech-recognition-transfer-learning"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
tabahi/formantfeatures
Extract frequency, power, width and dissonance of formants from wav files
libdriver/ld3320
LD3320 full-featured driver library for general-purpose MCU and Linux.
awsaf49/audio_classification_models
Tensorflow Audio Classification Models