scarletcho/prep4kaldi
Data preparation code for building Kaldi ASR system
This tool helps researchers and engineers prepare their speech data for building an Automatic Speech Recognition (ASR) system using Kaldi. It takes a collection of audio recordings and their corresponding transcriptions (either in plain text files or TextGrid format), organized by speaker, and generates the specific text files Kaldi needs to train a speech recognition model. This is for anyone creating custom ASR models from their own datasets.
No commits in the last 6 months.
Use this if you have a dataset of speech recordings and their transcriptions and need to format them correctly for building a Kaldi ASR system.
Not ideal if you are not using Kaldi for ASR development or if your speech data is not organized into speaker-specific folders with audio and text files.
Stars
14
Forks
9
Language
Python
License
GPL-3.0
Category
Last pushed
Mar 18, 2017
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/scarletcho/prep4kaldi"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
daanzu/kaldi-active-grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
gooofy/py-kaldi-asr
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as...
nttcslab-sp/kaldiio
A pure python module for reading and writing kaldi ark files
pykaldi/pykaldi
A Python wrapper for Kaldi
kaldi-asr/kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.