Ma-Dan/asr-decode
从Kaldi中裁剪的轻量级语音识别解码推理框架,目前实现了MFCC+GMM+Viterbi,不依赖OpenFST、OpenBLAS等库
This tool helps developers who need to integrate basic speech recognition capabilities into their applications. It takes an audio file (like a .wav) and speech model files, then outputs the transcribed text. It's designed for software engineers building systems that need to process spoken language.
No commits in the last 6 months.
Use this if you are a developer needing a lightweight, self-contained module for basic speech-to-text conversion without external library dependencies.
Not ideal if you need advanced speech recognition features, support for many languages, or a production-ready API for end-users.
Stars
22
Forks
4
Language
C++
License
Apache-2.0
Category
Last pushed
Jul 31, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Ma-Dan/asr-decode"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
daanzu/kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
gooofy/py-kaldi-asr
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as...
nttcslab-sp/kaldiio
A pure python module for reading and writing kaldi ark files
pykaldi/pykaldi
A Python wrapper for Kaldi
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.