keonlee9420/STYLER

Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021

48
/ 100
Emerging

This tool helps content creators and voice artists generate natural-sounding speech from text with fine-grained control over vocal style and expression. You provide written text and a reference audio clip, and it produces audio that mimics the style, emotion, and speaker characteristics of the reference, but speaking your new text. It's designed for anyone who needs to quickly create expressive voiceovers, audiobooks, or synthetic speech with specific vocal nuances.

160 stars. No commits in the last 6 months.

Use this if you need to generate high-quality, expressive voiceovers from text while maintaining a consistent vocal style or adapting the style from an existing audio sample, even in noisy conditions.

Not ideal if you're looking for a simple text-to-speech tool without needing advanced control over vocal style or noise handling.

Voice Synthesis Audio Production Speech Generation Content Creation Media Production
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 20 / 25

How are scores calculated?

Stars

160

Forks

31

Language

Python

License

MIT

Last pushed

Jun 05, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/keonlee9420/STYLER"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.