keonlee9420/VAENAR-TTS
PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.
This project helps create natural-sounding spoken audio from written text. You provide text, and it generates an audio file of someone speaking that text. This tool is for content creators, educators, or anyone needing to convert written scripts into voiceovers or spoken content.
No commits in the last 6 months.
Use this if you need to quickly generate speech from English text using a single female speaker's voice.
Not ideal if you require multi-speaker synthesis, highly customized voice characteristics, or voices in languages other than English.
Stars
73
Forks
14
Language
Python
License
MIT
Category
Last pushed
Aug 03, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/keonlee9420/VAENAR-TTS"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
bshall/Tacotron
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Kyubyong/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
DemisEom/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
Kyubyong/tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model