codersinthestorm/RecurrentNN_SpeechRecognition
A model based in Tensorflow to recognize words from the 30 word Speech Commands Dataset from Google using LSTM based Recurrent Neural Network.
This project helps integrate spoken commands into devices or applications by recognizing specific keywords. You provide short audio recordings of spoken words, and it identifies which of the 30 predefined words were spoken. This is ideal for product managers or engineers building voice interfaces for simple control tasks.
No commits in the last 6 months.
Use this if you need to add basic voice control to a system, such as responding to 'on', 'off', 'stop', or 'go'.
Not ideal if you need to recognize a wide vocabulary, continuous speech, or different languages beyond the specific 30 English commands.
Stars
11
Forks
3
Language
Python
License
—
Category
Last pushed
Dec 05, 2017
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/codersinthestorm/RecurrentNN_SpeechRecognition"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
tabahi/formantfeatures
Extract frequency, power, width and dissonance of formants from wav files
libdriver/ld3320
LD3320 full-featured driver library for general-purpose MCU and Linux.
awsaf49/audio_classification_models
Tensorflow Audio Classification Models