JaesungBae/Speech-Command-Recognition-with-Capsule-Network

Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.

/ 100

Emerging

This project helps you build and evaluate systems that can recognize short spoken commands, even in noisy environments. You input raw audio files, and it trains a model that outputs identified speech commands. This is useful for product developers or researchers building voice-controlled interfaces for smart devices, assistive technologies, or other applications where simple voice commands are needed.

No commits in the last 6 months.

Use this if you are developing a keyword spotting system and need to accurately identify specific spoken words from short audio clips, especially when background noise might be an issue.

Not ideal if you need to transcribe continuous speech, understand complex natural language, or develop a system for languages other than English.

voice-control keyword-spotting speech-recognition embedded-systems human-computer-interaction

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

julius-speech/julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

rolczynski/Automatic-Speech-Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

tabahi/formantfeatures

Extract frequency, power, width and dissonance of formants from wav files

libdriver/ld3320

LD3320 full-featured driver library for general-purpose MCU and Linux.

awsaf49/audio_classification_models

Tensorflow Audio Classification Models

Explore Voice AI Tools

All categories Trending Voice AI directory Insights