keonlee9420/PortaSpeech

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

/ 100

Emerging

This tool helps you turn written text into natural-sounding speech for a single speaker. You provide plain text, and it generates an audio file of that text being spoken. Content creators, educators, or anyone needing to generate voiceovers or audio content from scripts can use this.

341 stars. No commits in the last 6 months.

Use this if you need to quickly generate high-quality, single-speaker audio from text, especially when you want control over the speaking rate.

Not ideal if you need to generate speech from multiple distinct voices or if you require advanced emotional nuances in the generated audio.

audio-content-creation voiceover-production e-learning digital-publishing narration

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

341

Forks

Language

Python

License

MIT

Higher-rated alternatives

kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

fatchord/WaveRNN

WaveRNN Vocoder + TTS

shangeth/wavencoder

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation,...

rishikksh20/iSTFTNet-pytorch

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier...

seungwonpark/melgan

MelGAN vocoder (compatible with NVIDIA/tacotron2)

Explore Voice AI Tools

All categories Trending Voice AI directory Insights