keonlee9420/DiffSinger

PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)

/ 100

Emerging

This tool helps vocal synthesis artists and audio producers create realistic singing voices from text. You input written lyrics or phrases, and it generates high-quality audio of a synthesized voice singing those words. This is ideal for musicians, content creators, or voiceover artists looking to produce unique vocal tracks.

247 stars. No commits in the last 6 months.

Use this if you need to generate high-quality, controllable singing voice audio from text for creative projects.

Not ideal if you're looking for a multi-speaker solution or want to use a vocoder other than HiFi-GAN.

singing-synthesis vocal-production music-creation text-to-speech audio-generation

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

247

Forks

Language

Python

License

MIT

Compare

DiffSinger and DiffGAN-TTS

Higher-rated alternatives

PrunaAI/pruna

Pruna is a model optimization framework built for developers, enabling you to deliver faster,...

bytedance/LatentSync

Taming Stable Diffusion for Lip Sync!

haoheliu/AudioLDM-training-finetuning

AudioLDM training, finetuning, evaluation and inference.

Text-to-Audio/Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

teticio/audio-diffusion

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead...

Explore Diffusion Models

All categories Trending Diffusion directory Insights