candlewill/AiVoice

Deep CNN networks for Speech Synthesis

/ 100

Emerging

This project helps developers create a single-speaker text-to-speech system. You provide text sentences and audio recordings from a single speaker, and the system learns to convert new text into spoken audio in that speaker's voice. This is primarily used by engineers or researchers working on speech synthesis technology.

No commits in the last 6 months.

Use this if you are a developer or researcher looking to experiment with or build a text-to-speech system for a single speaker.

Not ideal if you need an out-of-the-box solution for generating speech without any programming or machine learning expertise.

speech-synthesis text-to-speech audio-generation deep-learning voice-cloning

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 18 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

fatchord/WaveRNN

WaveRNN Vocoder + TTS

shangeth/wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation,...

rishikksh20/iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier...

seungwonpark/melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights