MainRo/deepspeech-server

A testing server for a speech to text service based on coqui.ai

/ 100

Established

This project provides a simple way to test speech-to-text accuracy using Coqui STT models. You input an audio file (like a WAV) and it outputs the transcribed text. It's designed for anyone needing to evaluate the performance of different speech recognition models for their applications or research.

219 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need a straightforward HTTP server to assess how well various Coqui STT models convert spoken audio into text.

Not ideal if you're looking for a production-ready, highly scalable speech-to-text service or a tool for custom model training.

speech-recognition audio-transcription model-evaluation voice-technology natural-language-processing

Stale 6m No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 23 / 25

How are scores calculated?

Stars

219

Forks

Language

Python

License

MPL-2.0

Related tools

shibing624/parrots

Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成，支持多语言，准确率高

altunenes/parakeet-rs

very fast speech-to-text, diarization, streaming (even in CPU) with NVIDIA Parakeet in Rust

thewh1teagle/pyannote-rs

pyannote audio diarization in rust

PaddlePaddle/Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS,...

daanzu/deepspeech-websocket-server

Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments

Explore Voice AI Tools

All categories Trending Voice AI directory Insights