sk-g/Speech-Recognition-Tensorflow-Challenge
Different CNN Models for keyword spotting in speech recognition
This project helps audio engineers and researchers automatically identify spoken keywords within audio recordings. It takes raw audio clips, converts them into visual spectrograms, and then processes these images to pinpoint specific words, even in noisy environments. The primary user would be someone involved in audio analysis or developing voice-controlled applications.
No commits in the last 6 months.
Use this if you need to build or evaluate a system for recognizing a limited set of spoken keywords from audio files, especially if you're working with spectrogram images.
Not ideal if you're looking for a general-purpose transcription service for arbitrary speech or if you need to process live audio streams.
Stars
10
Forks
1
Language
Python
License
GPL-3.0
Category
Last pushed
Jul 11, 2018
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/sk-g/Speech-Recognition-Tensorflow-Challenge"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Picovoice/porcupine
On-device wake word detection powered by deep learning
MycroftAI/mycroft-precise
A lightweight, simple-to-use, RNN wake word listener
arcosoph/nanowakeword
A lightweight, open-source, and intelligent wake word detection engine. Train custom,...
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run...
OAID/cortex-m-kws
Cortex M KWS example with Tengine Lite.