ColdFir5/Uberduck-Tacotron2-Multi-Speaker-GSTs-Implementation
Currently this is not working as the repo used to make the notebook work is under-development, This is a notebook from Kaggle I had made that allows user's to make their own AI voices using 16bit PCM, 22050 HZ WAV files on the Neural networks provided by NVIDIA's creation of Tacotron 2 which has been further developed and worked on by the team at Uberduck.ai in order to add other amazing features to it using Tacotron 2 GPT. These AI TTS voices can be used regularly to output synthesised vocals or you can use reference audio in order to let the AI adapt to the flow of the reference audio to output a similar result.
No commits in the last 6 months.
Stars
3
Forks
—
Language
Jupyter Notebook
License
—
Category
Last pushed
Feb 21, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ColdFir5/Uberduck-Tacotron2-Multi-Speaker-GSTs-Implementation"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
bshall/Tacotron
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Kyubyong/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
DemisEom/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
Kyubyong/tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model