lars76/forced-alignment-chinese

Mandarin Chinese audio datasets aligned with Montreal Forced Aligner

/ 100

Emerging

This project helps speech researchers and linguists precisely time spoken Mandarin Chinese words and sounds within audio recordings. It takes an audio file and its written Chinese transcript, then outputs a detailed breakdown of when each word or even individual sound begins and ends. Voice talent managers or language educators can use this to analyze speech patterns with greater accuracy.

No commits in the last 6 months.

Use this if you need to accurately pinpoint the start and end times of Chinese words and phonemes in audio recordings, especially for Mandarin speech analysis or language research.

Not ideal if you are working with languages other than Mandarin Chinese or if you only need rough, sentence-level timing rather than precise word and phoneme-level alignment.

speech-research linguistics mandarin-analysis voice-processing language-education

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

daanzu/kaldi-active-grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

gooofy/py-kaldi-asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as...

nttcslab-sp/kaldiio

A pure python module for reading and writing kaldi ark files

pykaldi/pykaldi

A Python wrapper for Kaldi

kaldi-asr/kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights