SEPIA-Framework/sepia-stt-server

SEPIA server to support open-source speech recognition via WebSocket connection.

/ 100

Emerging

This project provides a server for real-time speech-to-text conversion, enabling applications to convert spoken audio into written text instantly. It takes a live stream of audio as input and outputs transcribed text. This is useful for developers who need to integrate automatic speech recognition into their applications, particularly for voice assistants, transcription services, or IoT devices.

136 stars. No commits in the last 6 months.

Use this if you need to add real-time, open-source speech recognition capabilities to an application or device, especially if you require support for multiple ASR engines or deployment on single-board computers.

Not ideal if you are a non-technical user looking for a ready-to-use speech-to-text application rather than a server to build upon.

voice-assistant audio-transcription IoT-integration real-time-audio-processing application-development

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

136

Forks

Language

Python

License

MIT

Higher-rated alternatives

shibing624/parrots

Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成，支持多语言，准确率高

altunenes/parakeet-rs

very fast speech-to-text, diarization, streaming (even in CPU) with NVIDIA Parakeet in Rust

MainRo/deepspeech-server

A testing server for a speech to text service based on coqui.ai

thewh1teagle/pyannote-rs

pyannote audio diarization in rust

PaddlePaddle/Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS,...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights