keonlee9420/STYLER
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021
This tool helps content creators and voice artists generate natural-sounding speech from text with fine-grained control over vocal style and expression. You provide written text and a reference audio clip, and it produces audio that mimics the style, emotion, and speaker characteristics of the reference, but speaking your new text. It's designed for anyone who needs to quickly create expressive voiceovers, audiobooks, or synthetic speech with specific vocal nuances.
160 stars. No commits in the last 6 months.
Use this if you need to generate high-quality, expressive voiceovers from text while maintaining a consistent vocal style or adapting the style from an existing audio sample, even in noisy conditions.
Not ideal if you're looking for a simple text-to-speech tool without needing advanced control over vocal style or noise handling.
Stars
160
Forks
31
Language
Python
License
MIT
Category
Last pushed
Jun 05, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/keonlee9420/STYLER"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for...
lucasnewman/nanospeech
A simple, hackable text-to-speech system in PyTorch and MLX
Tomiinek/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing,...
jxzhanggg/nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC
rishikksh20/FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech