caizexin/tf_multispeakerTTS_fc
the Tensorflow version of multi-speaker TTS training with feedback constraint
This project helps create realistic-sounding speech from text, specifically for voices that are already familiar, or for synthesizing speech in multiple distinct voices. It takes text and pre-recorded audio of a speaker's voice as input, and outputs natural-sounding spoken audio in that speaker's voice. Voice artists, audiobook producers, or developers building custom voice assistants would find this useful.
No commits in the last 6 months.
Use this if you need to generate high-quality, natural-sounding speech for specific voices, especially when maintaining a consistent speaker identity across different pieces of text is important.
Not ideal if you're looking for a simple text-to-speech system for a single, generic voice without needing to train on custom speaker data.
Stars
40
Forks
32
Language
Python
License
MIT
Category
Last pushed
Oct 12, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/caizexin/tf_multispeakerTTS_fc"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for...
lucasnewman/nanospeech
A simple, hackable text-to-speech system in PyTorch and MLX
Tomiinek/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing,...
keonlee9420/STYLER
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech...
jxzhanggg/nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC