Keyword Speech Recognition Voice AI Tools
Machine learning models for recognizing isolated spoken words/commands from audio using CNNs, RNNs, and neural networks. Does NOT include continuous speech-to-text ASR, end-to-end speech recognition pipelines, or general audio classification beyond single-word detection.
There are 126 keyword speech recognition tools tracked. 1 score above 50 (established tier). The highest-rated is julius-speech/julius at 51/100 with 1,930 stars.
Get all 126 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=keyword-speech-recognition&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine |
|
Established |
| 2 |
rolczynski/Automatic-Speech-Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow) |
|
Emerging |
| 3 |
tabahi/formantfeatures
Extract frequency, power, width and dissonance of formants from wav files |
|
Emerging |
| 4 |
libdriver/ld3320
LD3320 full-featured driver library for general-purpose MCU and Linux. |
|
Emerging |
| 5 |
awsaf49/audio_classification_models
Tensorflow Audio Classification Models |
|
Emerging |
| 6 |
shenasa-ai/speech2text
A Deep-Learning-Based Persian Speech Recognition System |
|
Emerging |
| 7 |
subho406/TF-Speech-Recognition-Challenge-Solution
Source code of the model used in Tensorflow Speech Recognition Challenge... |
|
Emerging |
| 8 |
xxbb1234021/speech_recognition
中文语音识别 |
|
Emerging |
| 9 |
stefantaubert/mel-cepstral-distance
A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral... |
|
Emerging |
| 10 |
felixchenfy/Speech-Commands-Classification-by-LSTM-PyTorch
Classification of 11 types of audio clips using MFCCs features and LSTM.... |
|
Emerging |
| 11 |
MohammedRashad/FPGA-Speech-Recognition
Expiremental Speech Recognition System using VHDL & MATLAB. |
|
Emerging |
| 12 |
kamilc/speech-recognition
Companion repository for the blog article:... |
|
Emerging |
| 13 |
AkojimaSLP/Beamforming-for-speech-enhancement
simple delaysum, MVDR and CGMM-MVDR |
|
Emerging |
| 14 |
supikiti/PNCC
A implementation of Power Normalized Cepstral Coefficients: PNCC |
|
Emerging |
| 15 |
tugstugi/pytorch-speech-commands
Speech commands recognition with PyTorch | Kaggle 10th place solution in... |
|
Emerging |
| 16 |
Sciss/SpeechRecognitionHMM
Exported from... |
|
Emerging |
| 17 |
zhihanyang2022/gender-audio-classification
A speaker gender classifier. MFC feature engineering and a pre-trained... |
|
Emerging |
| 18 |
hamzaehsan97/Speech_Recognition_CNN
CNN (Convolutional Neural Networks) Speech Recognition |
|
Emerging |
| 19 |
SkyDocs/speaker-identification
Speaker Identification using Neural Net. |
|
Emerging |
| 20 |
yh1008/speech-to-text
mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras |
|
Emerging |
| 21 |
wblgers/hmm_speech_recognition_demo
A demo for simple isolated Chinese speech word recognition using GMMHMM in Python |
|
Emerging |
| 22 |
lucko515/Speech-commands-recognition
Recognizing common speech commands using Keras and Tensorflow. |
|
Emerging |
| 23 |
placebokkk/e6870
assignments for e6870 ASR class |
|
Emerging |
| 24 |
Ralireza/spoken-digit-recognition
Classifying English spoken digit by Hidden Markov Model |
|
Emerging |
| 25 |
AkojimaSLP/Frame-by-frame-closed-form-update-for-mask-based-adaptive-MVDR-beamforming
speech-enhacement |
|
Emerging |
| 26 |
cosmoquester/speech-recognition
Develop speech recognition models with Tensorflow 2 |
|
Emerging |
| 27 |
msalhab96/SpeeQ
A framework for automatic speech recognition |
|
Emerging |
| 28 |
gogyzzz/beamformit_matlab
A MATLAB implementation of CHiME4 baseline Beamformit |
|
Emerging |
| 29 |
creafz/kaggle-speech-recognition
Solution for TensorFlow Speech Recognition Challenge on Kaggle (125th place, top 10%) |
|
Emerging |
| 30 |
HristovB/Speech_Recognition_Macedonian
Speech recognition model for recognising Macedonian spoken language. |
|
Emerging |
| 31 |
Pooventhiran/VSR
Speaker-Independent Speech Recognition using Visual Features |
|
Emerging |
| 32 |
arthurfortes/speech2text_keras
This repository reports how to build a speech to text model to recognize... |
|
Emerging |
| 33 |
zhongyuchen/speech-classification
CNN and VGG speech classification with interactive website for testing |
|
Emerging |
| 34 |
super13/tensorflow-speech-recognition-pai
Speech recognition using tensorflow in aliyun pai. |
|
Emerging |
| 35 |
ShihabYasin/Isolated-Bengali-Word-and-Speaker-Recognition.
Isolated Bengali word and speaker recognition. |
|
Emerging |
| 36 |
ace19-dev/tensorflow-speech-recognition-challenge
Kaggle Competitions: TensorFlow Speech Recognition Challenge |
|
Emerging |
| 37 |
ShoYamanishi/AndroidMFCC
26-Point MFCC & 512-Point FFT Generator & Visualizer in Java, C++, and NEON... |
|
Emerging |
| 38 |
guglielmocamporese/learning_invariances_in_speech_recognition
In this work I investigate the speech command task developing and analyzing... |
|
Emerging |
| 39 |
JaesungBae/Speech-Command-Recognition-with-Capsule-Network
Speech command recognition with capsule network & various NNs / KWS on... |
|
Emerging |
| 40 |
PiasRoY/Bangla-Spoken-Number-Recognition
recognizing spoken Bangla numbers using MFCCs and CNN. |
|
Emerging |
| 41 |
TCL606/Speech-Number-Recognition
基于数字信号处理的语音数字识别器 |
|
Emerging |
| 42 |
vinbhaskara/Digit-Speech-Recognition
Using MFCC features on Speech Signals to classify Digits after matching... |
|
Emerging |
| 43 |
theawless/sr-lib
Automatic Speech Recognition library for my BTech Project. |
|
Emerging |
| 44 |
backpropper/DNN-Activation-Brain
Code repository for Dissecting the DNN Brain for a Better Insight (ICASSP 2016) |
|
Emerging |
| 45 |
saztorralba/CNNWordReco
Code and scripts for training and testing isolated spoken word recognition... |
|
Emerging |
| 46 |
common-voice/our-voices-model-competition
Our Voices Competition |
|
Emerging |
| 47 |
gtiwari333/speech-recognition-java-hidden-markov-model-vq-mfcc
Automatically exported from... |
|
Emerging |
| 48 |
seyedsaleh/persian-speech-recognition
Simple word recognition using CNN on Raspberry Pi board 🗣 |
|
Emerging |
| 49 |
trungd/speech-recognition
experimental speech recognition library in tensorflow |
|
Emerging |
| 50 |
orbxball/DSP
2016 Autumn (105-1) -- Fundamentals of Digital Speech Signal Processing |
|
Emerging |
| 51 |
timkrebs/VoiceDetection
Speech Recognition implementation with MFCC and HMM |
|
Emerging |
| 52 |
aishoot/DTWSpeech
A simple application of DTW Algorithm in isolate word speech recognition. |
|
Emerging |
| 53 |
mhagglun/Speech-Recognition
Tensorflow implementation for Speech Recognition using Convolutional Neural... |
|
Experimental |
| 54 |
aleksandarbos/Sound-Recognition-Convo2D-Neural-Network
Tools: Python (OpenCV 3.0 + Keras lib-Convolution 2D Neural Network). Desc:... |
|
Experimental |
| 55 |
verrannt/snn_speechrec
Convolutional Spiking Neural Network to recognize speech utterances using... |
|
Experimental |
| 56 |
rwightman/pytorch-commands
Some PyTorch code for the Kaggle Speech Recognition Challenge |
|
Experimental |
| 57 |
shitian-ni/speech-recognition-transfer-learning
Speech command recognition DenseNet transfer learning from UrbanSound8k in... |
|
Experimental |
| 58 |
sangramsingnk/Audio-Feature-Extraction
In sound processing, the mel-frequency cepstrum (MFC) is a representation of... |
|
Experimental |
| 59 |
anicolson/matlab_feat
Functions for creating speech features in MATLAB. |
|
Experimental |
| 60 |
Lhx94As/Awesome-Spoken-Language-Identification
An awesome spoken LID repository. (Working in progress |
|
Experimental |
| 61 |
popcornell/MicRank
MicRank is a Learning to Rank neural channel selection framework where a DNN... |
|
Experimental |
| 62 |
aminul-huq/Speech-Command-Classification
Speech command classification on Speech-Command v0.02 dataset using PyTorch... |
|
Experimental |
| 63 |
zssloth/TF-Speech-Recognition
Speech Recognition Using Tensorflow |
|
Experimental |
| 64 |
rwightman/tensorflow-speech_commands
Speech commands training/models from TF repo adapted for speech commands Kaggle |
|
Experimental |
| 65 |
codersinthestorm/RecurrentNN_SpeechRecognition
A model based in Tensorflow to recognize words from the 30 word Speech... |
|
Experimental |
| 66 |
ivallesp/Xception1d
Xception1d implementation for audio categorization |
|
Experimental |
| 67 |
miguelangelnieto/DNN-Speech-Recognizer
Built a deep neural network that functions as part of an end-to-end... |
|
Experimental |
| 68 |
AmourWaltz/BayesLMs
Project of IEEE/ACM TASLP “Bayesian Neural Network Language Modeling for... |
|
Experimental |
| 69 |
cmaroti/speech_recognition
Convolutional Neural Network for Speech Recognition, implemented in Ms. Pacman game |
|
Experimental |
| 70 |
techbd123/SpeechRecognition
Bengali Speech Recognition |
|
Experimental |
| 71 |
wvangansbeke/Audio-Speech
Build a cross-talk canceler and a speech recognizer |
|
Experimental |
| 72 |
YoungloLee/tf2-speech-recognition-las
Tensorflow 2 Speech Recognition Code (LAS) |
|
Experimental |
| 73 |
raminnakhli/HMM-DNN-Speech-Recognition
This repository is a Python implementation of HMM-DNN model. |
|
Experimental |
| 74 |
sindhura-pv/lip-reading
In this project, visual speech recognition has been attempted using 2 major... |
|
Experimental |
| 75 |
vault-42/AIND_DNN_Speech_Recognizer
End-to-end speech to text recognition |
|
Experimental |
| 76 |
salehsargolzaee/Audio-Signal-Processing-and-Feature-Extraction
Feature extraction from audio signal (explained in Persian) |
|
Experimental |
| 77 |
FarzadForuozanfar/Speech-Recognition
I recorded 10 voices with the same words from myself and compared them with... |
|
Experimental |
| 78 |
Amiannn/Simple-HmmGmm
Simple HMM implementation |
|
Experimental |
| 79 |
OldBonhart/TensorFlow_Speech_Recognition_Challenge
TensorFlow Speech Recognition Challenge -... |
|
Experimental |
| 80 |
nilkanthshirodkar/Speech-Recognition-Using-HMM
Automatic Speech Recognition (ASR) system was implemented using the HMM... |
|
Experimental |
| 81 |
type-a/speechnet
Automatic Speech Recognition |
|
Experimental |
| 82 |
YuriyGuts/gdg-speech-classifier
A machine learning system that recognizes the word 'Google' in human speech... |
|
Experimental |
| 83 |
vinsis/speech-commands-recognition
Single word speech recognition using PyTorch |
|
Experimental |
| 84 |
Erfanafshar/speech-gender-detection
An audio signal processing project that detects speaker gender from recorded... |
|
Experimental |
| 85 |
SvenWientjes/SpeechRecognition
Classifying sound signals as Links, Midden or Rechts using features computed... |
|
Experimental |
| 86 |
FandosA/Speech_Recognition_Keras_TF
Project I carried out during my Machine Learning course in the Master. |
|
Experimental |
| 87 |
kevobt/speech-to-text
Speech recognition framework using keras |
|
Experimental |
| 88 |
IvanEvan/chinese-digital-speech-recognition
中文数字语音识别:识别类语音验证码的8位数字语音 |
|
Experimental |
| 89 |
samuelebh/CNN-Spoken-Digit-Classifier
Repository containing Python code of a classifier that recognizes spoken... |
|
Experimental |
| 90 |
uigiporc/icon-sr
Progetto di Ingegneria della conoscenza, autori: Porcelli Luigi, Nicolo Cucinotta. |
|
Experimental |
| 91 |
ragibson/MFCC-speech-recognition
Real-time speech recognition via "Mel-Frequency Cepstral Coefficients"... |
|
Experimental |
| 92 |
mradovic38/dtw-speech-recognition
Speech recognition system that uses feature extraction and dynamic time... |
|
Experimental |
| 93 |
khaykingleb/research-playground
Efficient ML/DL implementations across multiple domains with K3s multi-node... |
|
Experimental |
| 94 |
dannis999/trained_SpeechRecognition
此项目用于备份一个完整的中文语音识别环境,包括环境配置和预训练模型,以方便直接使用 |
|
Experimental |
| 95 |
Pchambet/tp-hmm-markov
Markov Chains and Hidden Markov Models: weather modeling with discrete... |
|
Experimental |
| 96 |
YoungloLee/tf2-speech-recognition-transformer
Tensorflow 2 Speech Recognition Code (Transformer) |
|
Experimental |
| 97 |
samimoftheworld/Voice-Activity-Detection-FInal-Project-work
this repository concedes my project work done in my bachelors |
|
Experimental |
| 98 |
skyradez/Speech-Recognition-using-Convolutional-Neural-Network
Tutorial on Speech Recognition using Convolutional Neural Network |
|
Experimental |
| 99 |
showman-sharma/speech_writing-recognition
We are given 2 different problems to solve. 1. Isolated spoken digit... |
|
Experimental |
| 100 |
mohammadnabia/Speech-recognition-HMM
This project focuses on building a speech recognition system for the Farsi... |
|
Experimental |
| 101 |
briansm-github/shipping_recognition
Training/test data and code fror speech recognition experiments using UK... |
|
Experimental |
| 102 |
yihong1120/Speech-Commands-Classification-LSTM
A TensorFlow project for classifying speech commands using LSTM neural... |
|
Experimental |
| 103 |
shun60s/Wave-DNN-likelihood
音声認識エンジンJuliusのディクテーションキットに含まれるDNN-HMMモデルを利用して対数尤度を計算するpython |
|
Experimental |
| 104 |
belambert/cl-mfcc
MFCC feature computation |
|
Experimental |
| 105 |
trungrockyngo/GMM-speech-recognizer
Final project for CSCI 201 - Machine Learning |
|
Experimental |
| 106 |
VictorAtPL/Speech_Commands_Recognition_Bi_LSTM_with_Tensorflow_2
Neural Network with Bidirectional Long Short-Term Memory block for... |
|
Experimental |
| 107 |
inspektral/audioMNIST-classifier
simple CNN on MFCC for Audio MNIST classification |
|
Experimental |
| 108 |
alainnguema/SpeechLangID-GMM
Ce projet implémente un système de détection de langue capable d'identifier... |
|
Experimental |
| 109 |
Vujavujavuja/Pametna-Kuca-hub
A straightforward deep learning pipeline for audio classification using a... |
|
Experimental |
| 110 |
AdityaKshettri/Speech_Recognition_Using_MATLAB
Implementation of Speech Recognition System in MATLAB Environment using... |
|
Experimental |
| 111 |
jefflai108/scale
Some of my public work at https://hltcoe.jhu.edu/research/scale/scale-2017/ |
|
Experimental |
| 112 |
Moeinh77/voice-command-classification-Keras
command classification using Keras |
|
Experimental |
| 113 |
tuanio/audio-classification
Audio Classification with AlexNet and Speech Commands dataset |
|
Experimental |
| 114 |
tjysdsg/speech-recognition
GMM-HMM Continuous ASR Using Python and Numpy |
|
Experimental |
| 115 |
hakula139/naive-speech-recognizer
A naive speech recognizer from scratch, written in Python 3 |
|
Experimental |
| 116 |
parham1998/Isolated-Digits-Recognition
Implementation of Persian Isolated-Digits Recognition with Matlab |
|
Experimental |
| 117 |
rachelwiles/HMM-Speech-Recognition
Training a hidden Markov model through expectation-maximization, using... |
|
Experimental |
| 118 |
chrisakroyd/kaggle-speech-recognition
Top 24% entry into the Kaggle Speech Recognition Challenge. |
|
Experimental |
| 119 |
princeedey/SPEECH-RECOGNITION-USING-CORRELATION
This repository contains the function code for identifying different set of... |
|
Experimental |
| 120 |
parthvadhadiya/TensorFlow-Speech-Recognition-Challenge
this repository contains end to end python script to train speech data... |
|
Experimental |
| 121 |
g1y5x3/Speech_Phone_Detection
Recognize base phones (/a/, /u/, /i/) from a given speech and indicate the... |
|
Experimental |
| 122 |
parvatijay2901/Footstep-Voice-Identification
MiiCare (Technical test): Detect the footstep |
|
Experimental |
| 123 |
GayathriRangu/DigitRecognition
This task of digit recognition is done using Hidden Markov Model using... |
|
Experimental |
| 124 |
tadakoglu/Speech-Recognition-with-MLP-and-Backpropagation-Algorithm
I have developed a simple speech recognition engine based on words using... |
|
Experimental |
| 125 |
Darnxca/BABELE-Riconoscimento-multilingua-senza-audio-tramite-PNN
This repository contains the university project for the Biometrics course.... |
|
Experimental |
| 126 |
sravi1210/Speech-Recognition
Speech Recognition System |
|
Experimental |