Kyubyong/tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
This project helps you transform written text into natural-sounding spoken audio. You input text, and it produces an audio file of that text being read aloud, much like an audiobook. This is ideal for content creators, educators, or anyone needing to generate speech from text for various applications.
1,833 stars. No commits in the last 6 months.
Use this if you need to create realistic spoken audio from text, especially for longer passages or datasets, and want control over the voice generation process.
Not ideal if you need to synthesize speech in real-time for interactive applications or require extremely fine-grained emotional control over the generated voice.
Stars
1,833
Forks
431
Language
Python
License
Apache-2.0
Category
Last pushed
Jan 17, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Kyubyong/tacotron"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
bshall/Tacotron
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Kyubyong/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
DemisEom/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
vlomme/Multi-Tacotron-Voice-Cloning
Phoneme multilingual(Russian-English) voice cloning based on