Kyubyong/tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

51
/ 100
Established

This project helps you transform written text into natural-sounding spoken audio. You input text, and it produces an audio file of that text being read aloud, much like an audiobook. This is ideal for content creators, educators, or anyone needing to generate speech from text for various applications.

1,833 stars. No commits in the last 6 months.

Use this if you need to create realistic spoken audio from text, especially for longer passages or datasets, and want control over the voice generation process.

Not ideal if you need to synthesize speech in real-time for interactive applications or require extremely fine-grained emotional control over the generated voice.

text-to-speech audiobook-creation content-localization narration e-learning
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

1,833

Forks

431

Language

Python

License

Apache-2.0

Last pushed

Jan 17, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Kyubyong/tacotron"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.