rishikksh20/vae_tacotron2

VAE Tacotron 2, an alternative of GST Tacotron

/ 100

Emerging

This project helps create realistic, customizable synthesized speech from text. You provide text and an audio clip of a desired voice style, and it generates an audio file speaking the text in that style. It is designed for researchers and practitioners working on advanced speech synthesis models.

No commits in the last 6 months.

Use this if you are experimenting with speech synthesis, specifically trying to control or transfer vocal style using text-to-speech models.

Not ideal if you need an out-of-the-box, production-ready speech synthesis solution with guaranteed high-quality style transfer.

speech-synthesis text-to-speech voice-generation audio-research neural-networks

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Compare

vae_tacotron2 and Tacotron-2

Higher-rated alternatives

bshall/Tacotron

A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis

Kyubyong/dc_tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

DemisEom/SpecAugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Rayhane-mamah/Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

Kyubyong/tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Explore Voice AI Tools

All categories Trending Voice AI directory Insights