felixchenfy/Speech-Commands-Classification-by-LSTM-PyTorch

Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augmentation.

/ 100

Emerging

This project helps you classify short audio clips into one of eleven specific commands, such as "one," "two," "stop," or "none." It takes a recorded audio snippet as input and tells you which command it detected. Anyone building simple voice control systems or needing to identify specific spoken words could use this.

No commits in the last 6 months.

Use this if you need to build a system that recognizes a limited set of spoken commands from short audio recordings.

Not ideal if you need to recognize a wide vocabulary of words or transcribe continuous speech into text.

voice-control speech-recognition audio-classification command-systems

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

julius-speech/julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

rolczynski/Automatic-Speech-Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

tabahi/formantfeatures

Extract frequency, power, width and dissonance of formants from wav files

libdriver/ld3320

LD3320 full-featured driver library for general-purpose MCU and Linux.

awsaf49/audio_classification_models

Tensorflow Audio Classification Models

Explore Voice AI Tools

All categories Trending Voice AI directory Insights