CSTR-Edinburgh/magphase
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
This system helps improve the quality and naturalness of synthetic speech for text-to-speech (TTS) and voice conversion applications. It takes raw speech recordings as input, extracts detailed sound characteristics, and then uses these to generate more realistic-sounding synthesized voices. This is useful for speech scientists, researchers, and developers working on advanced speech synthesis models.
No commits in the last 6 months.
Use this if you are developing statistical parametric speech synthesis systems and need a robust way to analyze and synthesize speech waveforms with improved sound quality, especially regarding the naturalness of phase spectra.
Not ideal if you are looking for a simple, off-the-shelf TTS application for end-users, or if you require out-of-the-box support for macOS or Python 3 without any modifications.
Stars
80
Forks
31
Language
Python
License
Apache-2.0
Category
Last pushed
Oct 14, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/CSTR-Edinburgh/magphase"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for...
lucasnewman/nanospeech
A simple, hackable text-to-speech system in PyTorch and MLX
Tomiinek/Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing,...
keonlee9420/STYLER
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech...
jxzhanggg/nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC