Ephrem-ETH/E2E-KWS
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
This helps speech and audio engineers create highly accurate voice-activated systems. It takes raw audio data, such as recordings of spoken commands, and identifies specific keywords within that audio. The primary users are developers and researchers building applications that require reliable keyword spotting.
No commits in the last 6 months.
Use this if you need a robust, end-to-end solution for detecting specific keywords in audio with high accuracy, even with limited training data.
Not ideal if you're looking for a pre-trained, plug-and-play keyword spotter without any development or training, or if your primary need is full speech-to-text transcription.
Stars
43
Forks
8
Language
Python
License
—
Category
Last pushed
Nov 18, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Ephrem-ETH/E2E-KWS"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Picovoice/porcupine
On-device wake word detection powered by deep learning
MycroftAI/mycroft-precise
A lightweight, simple-to-use, RNN wake word listener
arcosoph/nanowakeword
A lightweight, open-source, and intelligent wake word detection engine. Train custom,...
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run...
OAID/cortex-m-kws
Cortex M KWS example with Tengine Lite.