keonlee9420/STYLER

Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021

/ 100

Emerging

This tool helps content creators and voice artists generate natural-sounding speech from text with fine-grained control over vocal style and expression. You provide written text and a reference audio clip, and it produces audio that mimics the style, emotion, and speaker characteristics of the reference, but speaking your new text. It's designed for anyone who needs to quickly create expressive voiceovers, audiobooks, or synthetic speech with specific vocal nuances.

160 stars. No commits in the last 6 months.

Use this if you need to generate high-quality, expressive voiceovers from text while maintaining a consistent vocal style or adapting the style from an existing audio sample, even in noisy conditions.

Not ideal if you're looking for a simple text-to-speech tool without needing advanced control over vocal style or noise handling.

Voice Synthesis Audio Production Speech Generation Content Creation Media Production

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

160

Forks

Language

Python

License

MIT

Higher-rated alternatives

TensorSpeech/TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for...

lucasnewman/nanospeech

A simple, hackable text-to-speech system in PyTorch and MLX

Tomiinek/Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing,...

jxzhanggg/nonparaSeq2seqVC_code

Implementation code of non-parallel sequence-to-sequence VC

rishikksh20/FastSpeech2

PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech

Explore Voice AI Tools

All categories Trending Voice AI directory Insights