trinhtuanvubk/Diff-VC
Diffusion Model for Voice Conversion
This project helps you change the voice of an audio recording while keeping the original speech content. You provide an audio file with someone speaking and another audio file with the target voice you want to imitate. The result is the original speech content, but now spoken in the new target voice. This is useful for anyone working with audio content who needs to modify speaker identity, such as content creators, podcasters, or animators.
No commits in the last 6 months.
Use this if you need to transform the speaker's voice in an audio recording to sound like a different person's voice without changing what is being said.
Not ideal if you need to generate entirely new speech from text or translate speech into a different language.
Stars
69
Forks
8
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Mar 14, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/trinhtuanvubk/Diff-VC"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PrunaAI/pruna
Pruna is a model optimization framework built for developers, enabling you to deliver faster,...
bytedance/LatentSync
Taming Stable Diffusion for Lip Sync!
haoheliu/AudioLDM-training-finetuning
AudioLDM training, finetuning, evaluation and inference.
Text-to-Audio/Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
teticio/audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead...