creafz/kaggle-speech-recognition

Solution for TensorFlow Speech Recognition Challenge on Kaggle (125th place, top 10%)

/ 100

Emerging

This project helps classify short, one-second spoken commands into predefined categories like "yes," "no," or "stop." It takes audio files of spoken words and outputs a label for each word, determining what was said. This is useful for engineers developing voice control features or conversational AI systems.

No commits in the last 6 months.

Use this if you need a baseline system for accurately identifying specific spoken commands from short audio clips.

Not ideal if you need to transcribe long-form speech, recognize a very large vocabulary, or work with languages other than English.

voice-user-interface speech-recognition audio-classification command-and-control conversational-ai

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

julius-speech/julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

rolczynski/Automatic-Speech-Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

tabahi/formantfeatures

Extract frequency, power, width and dissonance of formants from wav files

libdriver/ld3320

LD3320 full-featured driver library for general-purpose MCU and Linux.

awsaf49/audio_classification_models

Tensorflow Audio Classification Models

Explore Voice AI Tools

All categories Trending Voice AI directory Insights