hayeong0/DDDM-VC

Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion" (AAAI 2024)

/ 100

Emerging

This project helps you change the voice of a spoken audio recording while keeping the original words and meaning intact. You provide an audio file of someone speaking and a target voice, and it generates a new audio file where the original speech is spoken in the target voice. This is useful for anyone working with synthetic speech or audio content creation.

243 stars. No commits in the last 6 months.

Use this if you need to convert speech from one voice to another, for example, to create consistent voiceovers or personalize audio content.

Not ideal if you need to generate speech from text, as this tool requires an existing audio input.

voice-conversion audio-synthesis speech-editing content-creation

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 15 / 25

How are scores calculated?

Stars

243

Forks

Language

Python

License

—

Higher-rated alternatives

huggingface/diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

bghira/SimpleTuner

A general fine-tuning kit geared toward image/video/audio diffusion models.

mcmonkeyprojects/SwarmUI

SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an...

nateraw/stable-diffusion-videos

Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts

TheDesignFounder/DreamLayer

Benchmark diffusion models faster. Automate evals, seeds, and metrics for reproducible results.

Explore Diffusion Models

All categories Trending Diffusion directory Insights