BogiHsu/Tacotron2-PyTorch
Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.
This is a tool for developers who want to create high-quality, natural-sounding synthetic speech from text. It takes written text as input and generates corresponding audio files (.wav). The primary users are machine learning engineers or researchers working on text-to-speech applications.
148 stars. No commits in the last 6 months.
Use this if you are a machine learning engineer or researcher developing custom text-to-speech models and need a flexible, performant PyTorch implementation of Tacotron 2.
Not ideal if you are a non-developer seeking an out-of-the-box text-to-speech service or simple audio generation from text without coding.
Stars
148
Forks
37
Language
Python
License
MIT
Category
Last pushed
Apr 12, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/BogiHsu/Tacotron2-PyTorch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
bshall/Tacotron
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Kyubyong/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
DemisEom/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
Kyubyong/tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model