keonlee9420/PortaSpeech
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
This tool helps you turn written text into natural-sounding speech for a single speaker. You provide plain text, and it generates an audio file of that text being spoken. Content creators, educators, or anyone needing to generate voiceovers or audio content from scripts can use this.
341 stars. No commits in the last 6 months.
Use this if you need to quickly generate high-quality, single-speaker audio from text, especially when you want control over the speaking rate.
Not ideal if you need to generate speech from multiple distinct voices or if you require advanced emotional nuances in the generated audio.
Stars
341
Forks
38
Language
Python
License
MIT
Category
Last pushed
Feb 17, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/keonlee9420/PortaSpeech"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
fatchord/WaveRNN
WaveRNN Vocoder + TTS
shangeth/wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation,...
rishikksh20/iSTFTNet-pytorch
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier...
seungwonpark/melgan
MelGAN vocoder (compatible with NVIDIA/tacotron2)