kaituoxu/Tacotron2
A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".
This tool helps researchers and voice synthesis engineers convert written text into natural-sounding speech. You provide text input, and it generates corresponding audio files. It is primarily used by those working on advanced speech synthesis projects and experimenting with state-of-the-art neural network architectures.
No commits in the last 6 months.
Use this if you need to generate high-quality, human-like speech from text and are comfortable working with a command-line interface and training deep learning models.
Not ideal if you're looking for a simple, off-the-shelf text-to-speech application without any deep learning model training or customization.
Stars
52
Forks
14
Language
Python
License
—
Category
Last pushed
Jan 30, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/kaituoxu/Tacotron2"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
bshall/Tacotron
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Kyubyong/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
DemisEom/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
Kyubyong/tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model