subho406/TF-Speech-Recognition-Challenge-Solution
Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.
This project provides a pre-built solution for recognizing simple spoken commands from audio recordings. You feed it audio files containing single words, and it tells you which command was spoken. This is useful for anyone developing or prototyping systems that need to understand basic voice commands, such as smart home devices, accessibility tools, or interactive voice response systems.
No commits in the last 6 months.
Use this if you need a high-performing baseline or a direct solution for classifying single-word audio commands like 'yes', 'no', 'up', or 'down'.
Not ideal if you need to recognize continuous speech, complex sentences, or commands not present in the Speech Commands Dataset.
Stars
58
Forks
28
Language
Jupyter Notebook
License
GPL-3.0
Category
Last pushed
Mar 30, 2018
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/subho406/TF-Speech-Recognition-Challenge-Solution"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
tabahi/formantfeatures
Extract frequency, power, width and dissonance of formants from wav files
libdriver/ld3320
LD3320 full-featured driver library for general-purpose MCU and Linux.
awsaf49/audio_classification_models
Tensorflow Audio Classification Models