narVidhai/Speech-Transcription-Benchmarking

Example python scripts to evaluate various ASR methods

/ 100

Experimental

This project helps you transcribe large batches of audio files into text using popular speech-to-text services like Google, AWS, and Microsoft. You provide a folder of .wav audio files, and it returns a corresponding folder of .txt transcription files. This is ideal for researchers, data analysts, or anyone needing to convert spoken content from many audio recordings into written form for further analysis.

No commits in the last 6 months.

Use this if you have a collection of WAV audio files and need to quickly get accurate text transcriptions using commercial speech-to-text APIs, or if you want to compare the performance of different transcription services.

Not ideal if you need a free, offline, or local transcription solution, or if your audio files are not in WAV format and require preprocessing.

audio-transcription research-analysis data-processing content-moderation qualitative-research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 7 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Featured in

Things AI Won't Tell You About Building a Voice App

Higher-rated alternatives

kahne/fastwer

A PyPI package for fast word/character error rate (WER/CER) calculation

analyticsinmotion/werpy

🐍📦 Ultra-fast Python package for calculating and analyzing the Word Error Rate (WER). Built for...

fgnt/meeteval

MeetEval - A meeting transcription evaluation toolkit

tabahi/bournemouth-forced-aligner

Extract phoneme-level timestamps from speeh audio.

wq2012/SimpleDER

A lightweight library to compute Diarization Error Rate (DER).

Explore Voice AI Tools

All categories Trending Voice AI directory Insights