Picovoice/speech-to-text-benchmark

speech to text benchmark framework

/ 100

Established

This tool helps developers and machine learning engineers compare the performance of different speech-to-text engines. It takes audio datasets and chosen speech-to-text engines as input, then outputs detailed metrics like Word Error Rate, Punctuation Error Rate, processing efficiency (Core-Hour), and real-time responsiveness (Word Emission Latency). This allows a technical professional to objectively evaluate which engine best suits their application's accuracy, speed, and resource requirements.

683 stars. Actively maintained with 1 commit in the last 30 days.

Use this if you need to quantitatively compare the accuracy and efficiency of various speech-to-text services and models for your development project.

Not ideal if you are an end-user simply looking to transcribe audio without needing to benchmark different underlying technologies.

speech-recognition natural-language-processing voice-ai model-evaluation software-development

No Package No Dependents

Maintenance 13 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

683

Forks

Language

Python

License

Apache-2.0

Related tools

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

huggingface/speech-to-speech

Build local voice agents with open-source models

linto-ai/WebVoiceSDK

Buildings block for voice-enabled applications in the browser

vox-serve/vox-serve

A Streaming-Native Serving Engine for TTS/STS Models

Lex-au/Orpheus-FastAPI

High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights