keonlee9420/DiffSinger
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
This tool helps vocal synthesis artists and audio producers create realistic singing voices from text. You input written lyrics or phrases, and it generates high-quality audio of a synthesized voice singing those words. This is ideal for musicians, content creators, or voiceover artists looking to produce unique vocal tracks.
247 stars. No commits in the last 6 months.
Use this if you need to generate high-quality, controllable singing voice audio from text for creative projects.
Not ideal if you're looking for a multi-speaker solution or want to use a vocoder other than HiFi-GAN.
Stars
247
Forks
33
Language
Python
License
MIT
Category
Last pushed
Feb 03, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/diffusion/keonlee9420/DiffSinger"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
PrunaAI/pruna
Pruna is a model optimization framework built for developers, enabling you to deliver faster,...
bytedance/LatentSync
Taming Stable Diffusion for Lip Sync!
haoheliu/AudioLDM-training-finetuning
AudioLDM training, finetuning, evaluation and inference.
Text-to-Audio/Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
teticio/audio-diffusion
Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead...