Yangyangii/AdvDCTTS
Implementation of DCTTS with Adversarial Training
This project helps speech synthesis researchers create realistic, high-quality synthetic speech directly from text. You input a written script and a pre-processed speech dataset, and it produces audio files of the text being spoken. It's designed for researchers and engineers working on text-to-speech (TTS) systems who want to experiment with advanced deep learning models.
No commits in the last 6 months.
Use this if you are a speech synthesis researcher looking to train a text-to-speech model using adversarial training techniques to generate sharp, clear spectrograms and natural-sounding speech.
Not ideal if you are a content creator or end-user simply looking to convert text to speech without deep involvement in model training and experimentation.
Stars
12
Forks
3
Language
Python
License
—
Category
Last pushed
Dec 30, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Yangyangii/AdvDCTTS"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
bshall/Tacotron
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Kyubyong/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
DemisEom/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
Kyubyong/tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model