ryuuji06/keyword-spotting
In this repository, I implement a system for detecting specific spoken words in speech signal. When reading a speech signal, I detect not only the presence, but also the time position of the keyword. For this purpose, I use a CNN-RNN network, with a CTC (Connectionist Temporal Classification) loss function.
This project helps you automatically find specific spoken words in audio recordings or real-time speech. You provide audio files or live microphone input, and it tells you not only if a keyword was spoken, but also exactly when it occurred. It's designed for anyone who needs to detect predefined voice commands or specific words in audio streams.
No commits in the last 6 months.
Use this if you need to reliably detect and pinpoint specific keywords in spoken language, whether from existing audio files or a live microphone.
Not ideal if you need to transcribe entire conversations or identify a wide range of arbitrary words, as it's optimized for a small, predefined set of keywords.
Stars
19
Forks
5
Language
Python
License
—
Category
Last pushed
Sep 27, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ryuuji06/keyword-spotting"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Picovoice/porcupine
On-device wake word detection powered by deep learning
MycroftAI/mycroft-precise
A lightweight, simple-to-use, RNN wake word listener
arcosoph/nanowakeword
A lightweight, open-source, and intelligent wake word detection engine. Train custom,...
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run...
OAID/cortex-m-kws
Cortex M KWS example with Tengine Lite.