revdotcom/fstalign

An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.

/ 100

Emerging

This tool helps speech-to-text transcriptionists and NLP engineers evaluate and refine automated speech recognition (ASR) outputs. It takes a correct reference transcript and a hypothesis transcript (the ASR output) to calculate the Word Error Rate (WER) and align the two sequences. This helps pinpoint exactly where the ASR system made errors (insertions, deletions, or substitutions) and understand the nature of those errors.

171 stars.

Use this if you need to accurately measure the performance of a speech-to-text system and understand specific alignment differences between human-generated and machine-generated transcripts.

Not ideal if you are looking for a general-purpose text comparison tool or a system that works with non-sequential or heavily structured data.

speech-to-text transcription-analysis ASR-evaluation natural-language-processing audio-transcription

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 10 / 25

How are scores calculated?

Stars

171

Forks

Language

C++

License

Apache-2.0

Higher-rated alternatives

daanzu/kaldi-active-grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

gooofy/py-kaldi-asr

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as...

nttcslab-sp/kaldiio

A pure python module for reading and writing kaldi ark files

pykaldi/pykaldi

A Python wrapper for Kaldi

kaldi-asr/kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Explore Voice AI Tools

All categories Trending Voice AI directory Insights