ivanvovk/compressed-tacotron2-pytorch
Compressed version of Tacotron 2 using Tensor Train + Waveglow.
This project helps reduce the computational resources needed for text-to-speech (TTS) systems. It takes an existing Tacotron 2 and WaveGlow model and compresses it, yielding a smaller model that generates speech faster. This would be used by engineers or researchers working on deploying efficient voice synthesis applications.
No commits in the last 6 months.
Use this if you need to deploy a text-to-speech system that synthesizes audio quickly and uses less memory, particularly for real-time applications or environments with limited resources.
Not ideal if you are looking for an out-of-the-box, pre-trained, highly accurate text-to-speech model without needing to worry about model compression or fine-tuning.
Stars
22
Forks
9
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Dec 26, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ivanvovk/compressed-tacotron2-pytorch"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
bshall/Tacotron
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Kyubyong/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
DemisEom/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
Kyubyong/tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model