goepfert/audio_features
Speech Recognition and Voice Activity Detection using a Convolutional Neural Network Architecture built with Tensorflow.js
This tool helps you build and train custom voice-controlled applications directly in a web browser. It takes your spoken words and background sounds as input, and outputs trained models that can detect when someone is speaking and classify specific spoken phrases. It's designed for anyone needing a simple, browser-based way to create custom speech recognition for specific commands or sounds.
No commits in the last 6 months.
Use this if you need a quick, client-side way to recognize a small set of predefined voice commands or detect speech activity without server-side processing.
Not ideal if you need to recognize a wide vocabulary, process continuous speech, or require enterprise-grade accuracy and robustness for critical applications.
Stars
13
Forks
4
Language
JavaScript
License
—
Category
Last pushed
Oct 24, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/goepfert/audio_features"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
aqibsaeed/Urban-Sound-Classification
Urban sound classification using Deep Learning
spotify/realbook
Easier audio-based machine learning with TensorFlow.
ArmDeveloperEcosystem/ml-audio-classifier-example-for-pico
ML Audio Classifier Example for Pico 🔊🔥🔔
mimbres/neural-audio-fp
Official implementation of Neural Audio Fingerprint (ICASSP 2021)
f4exb/morseangel
Deep Neural Network for Morse decoding