felixchenfy/Speech-Commands-Classification-by-LSTM-PyTorch
Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augmentation.
This project helps you classify short audio clips into one of eleven specific commands, such as "one," "two," "stop," or "none." It takes a recorded audio snippet as input and tells you which command it detected. Anyone building simple voice control systems or needing to identify specific spoken words could use this.
No commits in the last 6 months.
Use this if you need to build a system that recognizes a limited set of spoken commands from short audio recordings.
Not ideal if you need to recognize a wide vocabulary of words or transcribe continuous speech into text.
Stars
43
Forks
18
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Dec 14, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/felixchenfy/Speech-Commands-Classification-by-LSTM-PyTorch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
tabahi/formantfeatures
Extract frequency, power, width and dissonance of formants from wav files
libdriver/ld3320
LD3320 full-featured driver library for general-purpose MCU and Linux.
awsaf49/audio_classification_models
Tensorflow Audio Classification Models