ryuuji06/keyword-spotting

In this repository, I implement a system for detecting specific spoken words in speech signal. When reading a speech signal, I detect not only the presence, but also the time position of the keyword. For this purpose, I use a CNN-RNN network, with a CTC (Connectionist Temporal Classification) loss function.

29
/ 100
Experimental

This project helps you automatically find specific spoken words in audio recordings or real-time speech. You provide audio files or live microphone input, and it tells you not only if a keyword was spoken, but also exactly when it occurred. It's designed for anyone who needs to detect predefined voice commands or specific words in audio streams.

No commits in the last 6 months.

Use this if you need to reliably detect and pinpoint specific keywords in spoken language, whether from existing audio files or a live microphone.

Not ideal if you need to transcribe entire conversations or identify a wide range of arbitrary words, as it's optimized for a small, predefined set of keywords.

voice-control audio-analysis speech-detection command-recognition sound-monitoring
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 6 / 25
Maturity 8 / 25
Community 15 / 25

How are scores calculated?

Stars

19

Forks

5

Language

Python

License

Last pushed

Sep 27, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ryuuji06/keyword-spotting"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.