JaesungBae/Speech-Command-Recognition-with-Capsule-Network
Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.
This project helps you build and evaluate systems that can recognize short spoken commands, even in noisy environments. You input raw audio files, and it trains a model that outputs identified speech commands. This is useful for product developers or researchers building voice-controlled interfaces for smart devices, assistive technologies, or other applications where simple voice commands are needed.
No commits in the last 6 months.
Use this if you are developing a keyword spotting system and need to accurately identify specific spoken words from short audio clips, especially when background noise might be an issue.
Not ideal if you need to transcribe continuous speech, understand complex natural language, or develop a system for languages other than English.
Stars
25
Forks
6
Language
Python
License
—
Category
Last pushed
Jan 28, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/JaesungBae/Speech-Command-Recognition-with-Capsule-Network"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
tabahi/formantfeatures
Extract frequency, power, width and dissonance of formants from wav files
libdriver/ld3320
LD3320 full-featured driver library for general-purpose MCU and Linux.
awsaf49/audio_classification_models
Tensorflow Audio Classification Models