SEPIA-Framework/sepia-stt-server
SEPIA server to support open-source speech recognition via WebSocket connection.
This project provides a server for real-time speech-to-text conversion, enabling applications to convert spoken audio into written text instantly. It takes a live stream of audio as input and outputs transcribed text. This is useful for developers who need to integrate automatic speech recognition into their applications, particularly for voice assistants, transcription services, or IoT devices.
136 stars. No commits in the last 6 months.
Use this if you need to add real-time, open-source speech recognition capabilities to an application or device, especially if you require support for multiple ASR engines or deployment on single-board computers.
Not ideal if you are a non-technical user looking for a ready-to-use speech-to-text application rather than a server to build upon.
Stars
136
Forks
23
Language
Python
License
MIT
Category
Last pushed
Nov 07, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/SEPIA-Framework/sepia-stt-server"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
shibing624/parrots
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高
altunenes/parakeet-rs
very fast speech-to-text, diarization, streaming (even in CPU) with NVIDIA Parakeet in Rust
MainRo/deepspeech-server
A testing server for a speech to text service based on coqui.ai
thewh1teagle/pyannote-rs
pyannote audio diarization in rust
PaddlePaddle/Parakeet
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS,...