Kaldi ASR Ecosystem Voice AI Tools

Tools, recipes, models, and utilities built on or for the Kaldi ASR framework, including language-specific implementations, format converters, and training pipelines. Does NOT include non-Kaldi ASR systems, general speech recognition APIs, or TTS tools.

There are 69 kaldi asr ecosystem tools tracked. 10 score above 50 (established tier). The highest-rated is daanzu/kaldi-active-grammar at 61/100 with 347 stars.

Get all 69 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=voice-ai&subcategory=kaldi-asr-ecosystem&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 daanzu/kaldi-active-grammar

Python Kaldi speech recognition with grammars that can be set...

61
Established
2 gooofy/py-kaldi-asr

Some simple wrappers around kaldi-asr intended to make using kaldi's...

58
Established
3 nttcslab-sp/kaldiio

A pure python module for reading and writing kaldi ark files

57
Established
4 pykaldi/pykaldi

A Python wrapper for Kaldi

57
Established
5 kaldi-asr/kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

53
Established
6 scarletcho/KoLM

Korean text normalization and language preparation package for LM in...

52
Established
7 alumae/kaldi-gstreamer-server

Real-time full-duplex speech recognition server, based on the Kaldi toolkit...

51
Established
8 jcsilva/docker-kaldi-gstreamer-server

Dockerfile for kaldi-gstreamer-server.

51
Established
9 alumae/gst-kaldi-nnet2-online

GStreamer plugin around Kaldi's online neural network decoder

50
Established
10 goodatlas/zeroth

Kaldi-based Korean ASR (한국어 음성인식) open-source project

50
Established
11 ARBML/klaam

Arabic speech recognition, classification and text-to-speech.

49
Emerging
12 XiaoMi/kaldi-onnx

Kaldi model converter to ONNX

48
Emerging
13 alumae/kaldi-offline-transcriber

Offline transcription system for Estonian using Kaldi

48
Emerging
14 YoavRamon/awesome-kaldi

This is a list of features, scripts, blogs and resources for better using...

47
Emerging
15 dspavankumar/keras-kaldi

Keras Interface for Kaldi ASR

47
Emerging
16 jimbozhang/kaldi-gop

Kaldi-based goodness of pronunciation (GOP)

47
Emerging
17 revdotcom/fstalign

An efficient OpenFST-based tool for calculating WER and aligning two...

46
Emerging
18 skit-ai/kaldi-serve

Server framework for Kaldi ASR Toolkit

45
Emerging
19 opensource-spraakherkenning-nl/Kaldi_NL

Code related to the Dutch instance and user groups of the KALDI speech...

43
Emerging
20 collectivat/cmusphinx-models

Acoustic and language models for minorised languages.

39
Emerging
21 uiuc-sst/asr24

24-hour Automatic Speech Recognition

39
Emerging
22 loretoparisi/htk

HTK Toolkit with Linux 64 bit and Docker support

39
Emerging
23 srinivr/kaldi-long-audio-alignment

Long audio alignment using Kaldi

39
Emerging
24 falabrasil/kaldi-br

☕🇧🇷 Scripts para o Kaldi em Português Brasileiro

38
Emerging
25 scarletcho/prep4kaldi

Data preparation code for building Kaldi ASR system

38
Emerging
26 daanzu/kaldi_ag_training

Docker image and scripts for training finetuned or completely personal Kaldi...

37
Emerging
27 lars76/forced-alignment-chinese

Mandarin Chinese audio datasets aligned with Montreal Forced Aligner

37
Emerging
28 jcsilva/docker-kaldi-android

Dockerfile for compiling Kaldi for Android.

36
Emerging
29 Ma-Dan/asr-decode

从Kaldi中裁剪的轻量级语音识别解码推理框架,目前实现了MFCC+GMM+Viterbi,不依赖OpenFST、OpenBLAS等库

36
Emerging
30 hmeutzner/kaldi-avsr

Kaldi-based audio-visual speech recognition

36
Emerging
31 Hamahmi/kaldi-tut

This is a Kaldi tutorial for beginners

36
Emerging
32 m1el/nemotron-asr.cpp

Nemotron ASR rewrite to GGML

33
Emerging
33 Anwarvic/Arabic-Speech-Recognition

This repository contains my attempt to use two famous speech recognition...

32
Emerging
34 ZoraizQ/urdu-speech-recognition

Urdu Speech Recognition using Kaldi ASR, by training Triphone Acoustic GMMs...

32
Emerging
35 srvk/srvk-eesen-offline-transcriber

Top level code to transcribe English audio/video files into text/subtitles

32
Emerging
36 SethiPawandeep/kaldi-for-dummies

This is the repository for my version of Kaldi for Dummies example.

31
Emerging
37 german-asr/kaldi-german

Scripts for training Kaldi for German speech recognition (ASR).

31
Emerging
38 pigzach/MagicSpeechASR

magicspeech competition recipe

31
Emerging
39 amirharati/kaldi-alligner

scripts to align a given wave to its transcription using trained models by Kaldi

31
Emerging
40 mcw519/Brownie

Post processing for speech recognition

31
Emerging
41 tsengia/SphinxTrainHelper

A Bash script designed to make training sphinx4 and pocketsphinx acoustic...

31
Emerging
42 falabrasil/cmusphinx-br

Scripts e recursos para ASR em Português Brasileiro

31
Emerging
43 jailuthra/asr

Kaldi ASR wrapper scripts

30
Emerging
44 t13m/kaldi-readers-for-tensorflow

readers that enable reading kaldi ark in tensorflow

30
Emerging
45 lyncisdev/voco

Create a speech recognition system for programming by voice using Kaldi

29
Experimental
46 tifaniwarnita/indonesian-asr

Automatic speech recognition (ASR) for Indonesian language built by using...

29
Experimental
47 aalto-speech/finnish-parliament-scripts

Scripts for retrieving and aligning speech and meeting transcripts from the...

29
Experimental
48 mvshyvk/KaldiService

Service for easy access to speech recognition capabilities of Kaldi using...

28
Experimental
49 FarawaySail/Kaldi_thchs30

媒体与认知语音识别大作业

28
Experimental
50 bagustris/id

Iban-based Kaldi recipe for Indonesian speech Corpus, presented at ASJ Spring 2019.

27
Experimental
51 JarbasAl/pocketsphinx-models-mirror

pocketsphinx models for languages originating from the iberian peninsula

27
Experimental
52 mathquis/node-kaldi-online-nnet3-decoder

ASR online decoding using Kaldi NNet3 GrammarFST

27
Experimental
53 Agrover112/Goodness-of-Pronunciation-Pipelines-for-OOV-Problem

Goodness of Pronunciation Pipelines for OOV Removal

27
Experimental
54 conbitin/htk3.5-install

Installation steps of HTK 3.5 under Ubuntu

25
Experimental
55 synesthesiam/pt-synesthesiam

CMU Sphinx acoustic model for Portugese (pt-br)

24
Experimental
56 keymastervn/htksupport

Minimal HTK for supporting HTK in Vietnamese.

23
Experimental
57 sidgupta234/Indian_English_ASR

An Indian English ASR system based on Hidden Markov Models (HMM) has been...

22
Experimental
58 jerrykuo7727/ASR-common-voice-zh-tw

HMM-based ASR systems trained on CommonVoice(zh-TW) using Kaldi.

22
Experimental
59 burrmill/burrmill

BurrMill core

22
Experimental
60 alx741/kaldi_spanish_dimex100

Kaldi ASR Spanish example using the DIMEx100 corpus

22
Experimental
61 asrajeh/kaldi-arabic

HHM-based Arabic ASR using Kaldi engine

21
Experimental
62 sasivatsal7122/Ckrett-package-pypi

a very basic ciphering/deciphering tool

20
Experimental
63 lormaechea/kaldi-grammar-compiler

A minimal tool that helps transforming fixed grammars into compiled Finite...

19
Experimental
64 cassiotbatista/asr-remote

TV Remote Control via Offline Speech Recognition

18
Experimental
65 falabrasil/espnet-br

📍🇧🇷 Scripts para o ESPnet em Português Brasileiro

18
Experimental
66 falabrasil/htk-br

Scripts para treino de modelos acústicos

17
Experimental
67 tjysdsg/aidatatang_force_align

Perform force alignment on Mandarin data using aidatatang pretrained model...

12
Experimental
68 Agrover112/Kaldi-notes

Resources helpful for Kaldi

12
Experimental
69 cadia-lvl/althingi-asr

An ASR recipe and speech corpus of Icelandic parliamentary speeches

12
Experimental