rishikksh20/gmvae_tacotron
Gaussian Mixture VAE Tacotron
This project helps create realistic, synthesized speech from text, letting you generate custom voice outputs. You provide text and, optionally, a reference audio sample, and it produces an audio file of that text spoken aloud. It's designed for researchers or developers experimenting with advanced text-to-speech models and voice generation.
No commits in the last 6 months.
Use this if you are a researcher or developer who wants to experiment with or implement an advanced text-to-speech system that can synthesize custom sentences from text inputs.
Not ideal if you need a production-ready, highly controllable speech synthesis solution with guaranteed style transfer capabilities right out of the box, as this is an experimental implementation.
Stars
54
Forks
11
Language
Python
License
MIT
Category
Last pushed
Jul 06, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/rishikksh20/gmvae_tacotron"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
bshall/Tacotron
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Kyubyong/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
DemisEom/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
Kyubyong/tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model