rtk-ai/vox

A universal AI toolkit for high-performance Speech-to-Text (STT) and Text-to-Speech (TTS) processing, designed for low-latency and easy model integration.

/ 100

Emerging

This tool helps you quickly convert written text into spoken words (Text-to-Speech) or transcribe spoken words into text (Speech-to-Text). You provide text or audio, and it outputs spoken audio or transcribed text. It's designed for anyone who needs to integrate voice capabilities into their daily workflows, especially those who use AI assistants or need customizable speech for various applications.

Use this if you need a flexible way to generate speech from text or transcribe speech, especially if you're integrating with AI assistants or require customizable voice options like cloning.

Not ideal if you need a web-based, cloud-managed solution or a simple, single-purpose speech utility without customization options.

AI assistant voice command audio production content creation accessibility

No Package No Dependents

Maintenance 10 / 25

Adoption 7 / 25

Maturity 11 / 25

Community 3 / 25

How are scores calculated?

Stars

Forks

Language

Rust

License

—

Higher-rated alternatives

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

huggingface/speech-to-speech

Build local voice agents with open-source models

linto-ai/WebVoiceSDK

Buildings block for voice-enabled applications in the browser

Picovoice/speech-to-text-benchmark

speech to text benchmark framework

vox-serve/vox-serve

A Streaming-Native Serving Engine for TTS/STS Models

Explore Voice AI Tools

All categories Trending Voice AI directory Insights