cool-japan/voirs
VoiRS is a cutting-edge Text-to-Speech (TTS), Voice Recognition, Sound framework that unifies high-performance crates from the cool-japan ecosystem
VoiRS helps you create natural-sounding spoken audio from written text. You provide text, and it generates high-quality audio files in various voices and languages, allowing for real-time speech synthesis. This is ideal for content creators, educators, accessibility specialists, or anyone needing to convert written material into spoken word.
Use this if you need to transform text into human-like speech with high naturalness and speed, supporting multiple languages and flexible integration into your projects.
Not ideal if you require a large library of pre-trained voices for immediate use without any setup or customization, as the pre-trained model selection is currently limited.
Stars
23
Forks
3
Language
Rust
License
—
Category
Last pushed
Mar 06, 2026
Monthly downloads
12
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/cool-japan/voirs"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
huggingface/speech-to-speech
Build local voice agents with open-source models
linto-ai/WebVoiceSDK
Buildings block for voice-enabled applications in the browser
Picovoice/speech-to-text-benchmark
speech to text benchmark framework
vox-serve/vox-serve
A Streaming-Native Serving Engine for TTS/STS Models