ryuuji06/keyword-spotting

In this repository, I implement a system for detecting specific spoken words in speech signal. When reading a speech signal, I detect not only the presence, but also the time position of the keyword. For this purpose, I use a CNN-RNN network, with a CTC (Connectionist Temporal Classification) loss function.

/ 100

Experimental

This project helps you automatically find specific spoken words in audio recordings or real-time speech. You provide audio files or live microphone input, and it tells you not only if a keyword was spoken, but also exactly when it occurred. It's designed for anyone who needs to detect predefined voice commands or specific words in audio streams.

No commits in the last 6 months.

Use this if you need to reliably detect and pinpoint specific keywords in spoken language, whether from existing audio files or a live microphone.

Not ideal if you need to transcribe entire conversations or identify a wide range of arbitrary words, as it's optimized for a small, predefined set of keywords.

voice-control audio-analysis speech-detection command-recognition sound-monitoring

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

Picovoice/porcupine

On-device wake word detection powered by deep learning

MycroftAI/mycroft-precise

A lightweight, simple-to-use, RNN wake word listener

arcosoph/nanowakeword

A lightweight, open-source, and intelligent wake word detection engine. Train custom,...

mozilla/DeepSpeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run...

OAID/cortex-m-kws

Cortex M KWS example with Tengine Lite.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights