Adibian/ResGrad
Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
This tool helps create high-quality, natural-sounding speech from written text. You provide text inputs and optionally existing audio samples, and it generates clear, human-like spoken audio. It's designed for researchers and developers working on advanced text-to-speech systems.
No commits in the last 6 months.
Use this if you need to synthesize very high-fidelity speech from text for research or specialized applications.
Not ideal if you're looking for a simple, out-of-the-box text-to-speech solution for everyday use.
Stars
20
Forks
4
Language
Python
License
MIT
Category
Last pushed
Feb 09, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Adibian/ResGrad"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for...
lucasnewman/nanospeech
A simple, hackable text-to-speech system in PyTorch and MLX
Tomiinek/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing,...
keonlee9420/STYLER
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech...
jxzhanggg/nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC