cool-japan/voirs

VoiRS is a cutting-edge Text-to-Speech (TTS), Voice Recognition, Sound framework that unifies high-performance crates from the cool-japan ecosystem

/ 100

Emerging

VoiRS helps you create natural-sounding spoken audio from written text. You provide text, and it generates high-quality audio files in various voices and languages, allowing for real-time speech synthesis. This is ideal for content creators, educators, accessibility specialists, or anyone needing to convert written material into spoken word.

Use this if you need to transform text into human-like speech with high naturalness and speed, supporting multiple languages and flexible integration into your projects.

Not ideal if you require a large library of pre-trained voices for immediate use without any setup or customization, as the pre-trained model selection is currently limited.

audio-content-creation speech-synthesis voice-generation e-learning-development accessibility-tools

No Package No Dependents

Maintenance 10 / 25

Adoption 9 / 25

Maturity 15 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

Rust

License

—

Higher-rated alternatives

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

huggingface/speech-to-speech

Build local voice agents with open-source models

linto-ai/WebVoiceSDK

Buildings block for voice-enabled applications in the browser

Picovoice/speech-to-text-benchmark

speech to text benchmark framework

vox-serve/vox-serve

A Streaming-Native Serving Engine for TTS/STS Models

Explore Voice AI Tools

All categories Trending Voice AI directory Insights