MainRo/deepspeech-server
A testing server for a speech to text service based on coqui.ai
This project provides a simple way to test speech-to-text accuracy using Coqui STT models. You input an audio file (like a WAV) and it outputs the transcribed text. It's designed for anyone needing to evaluate the performance of different speech recognition models for their applications or research.
219 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need a straightforward HTTP server to assess how well various Coqui STT models convert spoken audio into text.
Not ideal if you're looking for a production-ready, highly scalable speech-to-text service or a tool for custom model training.
Stars
219
Forks
70
Language
Python
License
MPL-2.0
Category
Last pushed
Jul 12, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/MainRo/deepspeech-server"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
shibing624/parrots
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高
altunenes/parakeet-rs
very fast speech-to-text, diarization, streaming (even in CPU) with NVIDIA Parakeet in Rust
thewh1teagle/pyannote-rs
pyannote audio diarization in rust
PaddlePaddle/Parakeet
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS,...
daanzu/deepspeech-websocket-server
Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments