hayeong0/Diff-HierVC
Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation"
This tool helps convert speech from one speaker's voice to another while maintaining the original spoken content. You provide an audio recording of someone speaking and a sample of a target voice, and it generates a new audio recording where the original speech is spoken in the target voice. This is useful for content creators, game developers, or anyone needing to generate speech in various voices without re-recording.
235 stars. No commits in the last 6 months.
Use this if you need to transform spoken audio to sound like a different person, even if you only have a short sample of the target voice.
Not ideal if you need to create speech from text or are looking for highly customized voice modulation beyond speaker style transfer.
Stars
235
Forks
19
Language
Python
License
—
Category
Last pushed
Jul 03, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/hayeong0/Diff-HierVC"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PrunaAI/pruna
Pruna is a model optimization framework built for developers, enabling you to deliver faster,...
bytedance/LatentSync
Taming Stable Diffusion for Lip Sync!
haoheliu/AudioLDM-training-finetuning
AudioLDM training, finetuning, evaluation and inference.
Text-to-Audio/Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
teticio/audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead...