Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
This project helps generate natural-sounding speech from written text. You provide text as input, and it produces an audio file of that text spoken aloud. It's designed for researchers and developers working on advanced text-to-speech systems who need to experiment with and build upon state-of-the-art neural network architectures.
2,317 stars. No commits in the last 6 months.
Use this if you are a researcher or advanced developer looking to implement, train, and fine-tune a Tacotron-2 deep learning model for text-to-speech synthesis using specific speech datasets like LJSpeech or M-AILABS.
Not ideal if you need a plug-and-play solution for immediate text-to-speech conversion or if you're working with datasets that aren't similar to LJSpeech without custom preprocessing.
Stars
2,317
Forks
904
Language
Python
License
MIT
Category
Last pushed
Jul 06, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Rayhane-mamah/Tacotron-2"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
bshall/Tacotron
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Kyubyong/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
DemisEom/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Kyubyong/tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
vlomme/Multi-Tacotron-Voice-Cloning
Phoneme multilingual(Russian-English) voice cloning based on