Wendison/FCL-taco2
Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021
This project helps you create high-quality, natural-sounding speech from written text quickly and efficiently. You provide written text, and it generates an audio file of that text being spoken. This is useful for content creators, educators, or businesses needing to convert scripts into speech for voiceovers, audiobooks, or interactive voice systems.
No commits in the last 6 months.
Use this if you need to synthesize speech from text with a focus on speed, resource efficiency, and control over speech characteristics, delivering natural-sounding results.
Not ideal if you are looking for a simple, off-the-shelf text-to-speech solution without needing to perform dataset preparation or model training.
Stars
40
Forks
6
Language
Python
License
MIT
Category
Last pushed
Jul 17, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Wendison/FCL-taco2"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
bshall/Tacotron
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Kyubyong/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
DemisEom/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
Kyubyong/tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model