mush42/sonata
A cross-platform inference engine for neural TTS models.
This project helps integrate advanced text-to-speech capabilities into applications, allowing them to speak text aloud using neural models. It takes written text as input and produces natural-sounding spoken audio. This is primarily for developers building applications that require high-quality synthesized speech, such as virtual assistants, accessibility tools, or interactive voice response systems.
No commits in the last 6 months.
Use this if you are a developer looking to add a reliable, cross-platform neural text-to-speech engine to your software, especially if you need control over speech prosody and performance.
Not ideal if you are an end-user simply looking for a ready-to-use text-to-speech application; this project is a developer tool for building such applications.
Stars
73
Forks
15
Language
Rust
License
MIT
Category
Last pushed
Nov 25, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/mush42/sonata"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jpreprocess/jpreprocess
Japanese text preprocessor for Text-to-Speech applications (OpenJTalk rewrite in rust language)
jpreprocess/jbonsai
Voice synthesis library for Text-to-Speech applications (Currently HTS Engine rewrite in Rust language)
CodersCreative/natural-tts
A rust crate for easily implementing Text-To-Speech into your rust programs.
isomoes/blivedm_rs
一个功能强大的 Bilibili 直播间弹幕 WebSocket 客户端 Rust 库,支持实时弹幕监控、文字转语音(TTS)和浏览器 Cookie 自动检测。A powerful...
thewh1teagle/piper-rs
Use piper TTS models in Rust