rtk-ai/vox
A universal AI toolkit for high-performance Speech-to-Text (STT) and Text-to-Speech (TTS) processing, designed for low-latency and easy model integration.
This tool helps you quickly convert written text into spoken words (Text-to-Speech) or transcribe spoken words into text (Speech-to-Text). You provide text or audio, and it outputs spoken audio or transcribed text. It's designed for anyone who needs to integrate voice capabilities into their daily workflows, especially those who use AI assistants or need customizable speech for various applications.
Use this if you need a flexible way to generate speech from text or transcribe speech, especially if you're integrating with AI assistants or require customizable voice options like cloning.
Not ideal if you need a web-based, cloud-managed solution or a simple, single-purpose speech utility without customization options.
Stars
36
Forks
1
Language
Rust
License
—
Category
Last pushed
Mar 07, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/rtk-ai/vox"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
huggingface/speech-to-speech
Build local voice agents with open-source models
linto-ai/WebVoiceSDK
Buildings block for voice-enabled applications in the browser
Picovoice/speech-to-text-benchmark
speech to text benchmark framework
vox-serve/vox-serve
A Streaming-Native Serving Engine for TTS/STS Models