amirharati/kaldi-alligner

scripts to align a given wave to its transcription using trained models by Kaldi

/ 100

Emerging

This tool helps speech researchers and linguists precisely match spoken words in an audio file to their written transcriptions. You provide an audio recording and its text, and it outputs a detailed timeline showing exactly when each word, and even sounds like laughter or noise, occurs in the audio. It's for anyone needing accurate timing of speech components for analysis.

No commits in the last 6 months.

Use this if you need to create highly accurate, time-stamped transcriptions of audio recordings, including non-speech events, for detailed linguistic or phonetic analysis.

Not ideal if you need real-time transcription or if you're looking for a simple, off-the-shelf speech-to-text service without needing fine-grained temporal alignment.

speech-analysis linguistics phonetics audio-transcription discourse-analysis

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 8 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Shell

License

—

Higher-rated alternatives

daanzu/kaldi-active-grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

gooofy/py-kaldi-asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as...

nttcslab-sp/kaldiio

A pure python module for reading and writing kaldi ark files

pykaldi/pykaldi

A Python wrapper for Kaldi

kaldi-asr/kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights