narVidhai/Speech-Transcription-Benchmarking
Example python scripts to evaluate various ASR methods
This project helps you transcribe large batches of audio files into text using popular speech-to-text services like Google, AWS, and Microsoft. You provide a folder of .wav audio files, and it returns a corresponding folder of .txt transcription files. This is ideal for researchers, data analysts, or anyone needing to convert spoken content from many audio recordings into written form for further analysis.
No commits in the last 6 months.
Use this if you have a collection of WAV audio files and need to quickly get accurate text transcriptions using commercial speech-to-text APIs, or if you want to compare the performance of different transcription services.
Not ideal if you need a free, offline, or local transcription solution, or if your audio files are not in WAV format and require preprocessing.
Stars
11
Forks
1
Language
Python
License
MIT
Category
Last pushed
Dec 22, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/narVidhai/Speech-Transcription-Benchmarking"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
kahne/fastwer
A PyPI package for fast word/character error rate (WER/CER) calculation
analyticsinmotion/werpy
🐍📦 Ultra-fast Python package for calculating and analyzing the Word Error Rate (WER). Built for...
fgnt/meeteval
MeetEval - A meeting transcription evaluation toolkit
tabahi/bournemouth-forced-aligner
Extract phoneme-level timestamps from speeh audio.
wq2012/SimpleDER
A lightweight library to compute Diarization Error Rate (DER).