MainRo/deepspeech-server

A testing server for a speech to text service based on coqui.ai

58
/ 100
Established

This project provides a simple way to test speech-to-text accuracy using Coqui STT models. You input an audio file (like a WAV) and it outputs the transcribed text. It's designed for anyone needing to evaluate the performance of different speech recognition models for their applications or research.

219 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need a straightforward HTTP server to assess how well various Coqui STT models convert spoken audio into text.

Not ideal if you're looking for a production-ready, highly scalable speech-to-text service or a tool for custom model training.

speech-recognition audio-transcription model-evaluation voice-technology natural-language-processing
Stale 6m No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 23 / 25

How are scores calculated?

Stars

219

Forks

70

Language

Python

License

MPL-2.0

Last pushed

Jul 12, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/MainRo/deepspeech-server"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.