Kyubyong/tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

/ 100

Established

This project helps you transform written text into natural-sounding spoken audio. You input text, and it produces an audio file of that text being read aloud, much like an audiobook. This is ideal for content creators, educators, or anyone needing to generate speech from text for various applications.

1,833 stars. No commits in the last 6 months.

Use this if you need to create realistic spoken audio from text, especially for longer passages or datasets, and want control over the voice generation process.

Not ideal if you need to synthesize speech in real-time for interactive applications or require extremely fine-grained emotional control over the generated voice.

text-to-speech audiobook-creation content-localization narration e-learning

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 25 / 25

How are scores calculated?

Stars

1,833

Forks

431

Language

Python

License

Apache-2.0

Compare

tacotron and Tacotron tacotron and dc_tts tacotron and Tacotron-2 tacotron and GST-Tacotron tacotron and tacotron_asr tacotron and Tacotron-pytorch

Related tools

bshall/Tacotron

A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis

Kyubyong/dc_tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

DemisEom/SpecAugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Rayhane-mamah/Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

vlomme/Multi-Tacotron-Voice-Cloning

Phoneme multilingual(Russian-English) voice cloning based on

Explore Voice AI Tools

All categories Trending Voice AI directory Insights