rishikksh20/vae_tacotron2
VAE Tacotron 2, an alternative of GST Tacotron
This project helps create realistic, customizable synthesized speech from text. You provide text and an audio clip of a desired voice style, and it generates an audio file speaking the text in that style. It is designed for researchers and practitioners working on advanced speech synthesis models.
No commits in the last 6 months.
Use this if you are experimenting with speech synthesis, specifically trying to control or transfer vocal style using text-to-speech models.
Not ideal if you need an out-of-the-box, production-ready speech synthesis solution with guaranteed high-quality style transfer.
Stars
90
Forks
29
Language
Python
License
MIT
Category
Last pushed
Jul 06, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/rishikksh20/vae_tacotron2"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
bshall/Tacotron
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Kyubyong/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
DemisEom/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
Kyubyong/tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model