hayeong0/Diff-HierVC

Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Prior for Zero-shot Speaker Adaptation"

/ 100

Emerging

This tool helps convert speech from one speaker's voice to another while maintaining the original spoken content. You provide an audio recording of someone speaking and a sample of a target voice, and it generates a new audio recording where the original speech is spoken in the target voice. This is useful for content creators, game developers, or anyone needing to generate speech in various voices without re-recording.

235 stars. No commits in the last 6 months.

Use this if you need to transform spoken audio to sound like a different person, even if you only have a short sample of the target voice.

Not ideal if you need to create speech from text or are looking for highly customized voice modulation beyond speaker style transfer.

voice conversion audio production dubbing content creation speech synthesis

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 13 / 25

How are scores calculated?

Stars

235

Forks

Language

Python

License

—

Higher-rated alternatives

PrunaAI/pruna

Pruna is a model optimization framework built for developers, enabling you to deliver faster,...

bytedance/LatentSync

Taming Stable Diffusion for Lip Sync!

haoheliu/AudioLDM-training-finetuning

AudioLDM training, finetuning, evaluation and inference.

Text-to-Audio/Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

teticio/audio-diffusion

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead...

Explore Diffusion Models

All categories Trending Diffusion directory Insights