DiffSinger and DiffGAN-TTS

These are ecosystem siblings—both are PyTorch implementations of diffusion-based speech synthesis architectures (DiffSinger for singing and DiffGAN-TTS for general TTS) from the same author that share similar technical foundations but target different synthesis tasks.

DiffSinger
44
Emerging
DiffGAN-TTS
43
Emerging
Maintenance 0/25
Adoption 10/25
Maturity 16/25
Community 18/25
Maintenance 0/25
Adoption 10/25
Maturity 16/25
Community 17/25
Stars: 247
Forks: 33
Downloads:
Commits (30d): 0
Language: Python
License: MIT
Stars: 347
Forks: 44
Downloads:
Commits (30d): 0
Language: Python
License: MIT
Stale 6m No Package No Dependents
Stale 6m No Package No Dependents

About DiffSinger

keonlee9420/DiffSinger

PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)

This tool helps vocal synthesis artists and audio producers create realistic singing voices from text. You input written lyrics or phrases, and it generates high-quality audio of a synthesized voice singing those words. This is ideal for musicians, content creators, or voiceover artists looking to produce unique vocal tracks.

singing-synthesis vocal-production music-creation text-to-speech audio-generation

About DiffGAN-TTS

keonlee9420/DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

DiffGAN-TTS helps creators, educators, and content producers transform written text into high-quality, natural-sounding spoken audio. You input text, and it generates audio files of a single speaker or multiple speakers, with options to control elements like pitch and speaking rate. This is ideal for anyone who needs to quickly create voiceovers or spoken content from text.

text-to-speech voice-generation audiobook-creation elearning-content content-localization

Scores updated daily from GitHub, PyPI, and npm data. How scores work