Rayhane-mamah/Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

51
/ 100
Established

This project helps generate natural-sounding speech from written text. You provide text as input, and it produces an audio file of that text spoken aloud. It's designed for researchers and developers working on advanced text-to-speech systems who need to experiment with and build upon state-of-the-art neural network architectures.

2,317 stars. No commits in the last 6 months.

Use this if you are a researcher or advanced developer looking to implement, train, and fine-tune a Tacotron-2 deep learning model for text-to-speech synthesis using specific speech datasets like LJSpeech or M-AILABS.

Not ideal if you need a plug-and-play solution for immediate text-to-speech conversion or if you're working with datasets that aren't similar to LJSpeech without custom preprocessing.

speech-synthesis voice-generation natural-language-processing audio-engineering deep-learning-research
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 25 / 25

How are scores calculated?

Stars

2,317

Forks

904

Language

Python

License

MIT

Last pushed

Jul 06, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Rayhane-mamah/Tacotron-2"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.