Rayhane-mamah/Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

/ 100

Established

This project helps generate natural-sounding speech from written text. You provide text as input, and it produces an audio file of that text spoken aloud. It's designed for researchers and developers working on advanced text-to-speech systems who need to experiment with and build upon state-of-the-art neural network architectures.

2,317 stars. No commits in the last 6 months.

Use this if you are a researcher or advanced developer looking to implement, train, and fine-tune a Tacotron-2 deep learning model for text-to-speech synthesis using specific speech datasets like LJSpeech or M-AILABS.

Not ideal if you need a plug-and-play solution for immediate text-to-speech conversion or if you're working with datasets that aren't similar to LJSpeech without custom preprocessing.

speech-synthesis voice-generation natural-language-processing audio-engineering deep-learning-research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 25 / 25

How are scores calculated?

Stars

2,317

Forks

904

Language

Python

License

MIT

Compare

Tacotron-2 and tacotron Tacotron-2 and Tacotron2-PyTorch Tacotron-2 and Tacotron-pytorch Tacotron-2 and tacotron2 Tacotron-2 and vae_tacotron2

Related tools

bshall/Tacotron

A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis

Kyubyong/dc_tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

DemisEom/SpecAugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Kyubyong/tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

vlomme/Multi-Tacotron-Voice-Cloning

Phoneme multilingual(Russian-English) voice cloning based on

Explore Voice AI Tools

All categories Trending Voice AI directory Insights