e-c-k-e-r/vall-e
An unofficial PyTorch implementation of VALL-E
This project helps you create realistic-sounding speech from text using very little original audio. You provide written text and a short audio sample of the voice you want to clone, and it generates new speech in that same voice. This tool is useful for content creators, voice artists, or anyone needing to generate customized spoken audio.
No commits in the last 6 months.
Use this if you need to quickly generate new speech in a specific voice without extensive recording, perfect for tasks like audiobook narration, podcast snippets, or character dialogue.
Not ideal if you need to precisely edit existing audio recordings or require extremely nuanced emotional expression beyond what text-to-speech can provide.
Stars
88
Forks
7
Language
Python
License
AGPL-3.0
Category
Last pushed
Aug 03, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/e-c-k-e-r/vall-e"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
bshall/Tacotron
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
Kyubyong/dc_tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
DemisEom/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Rayhane-mamah/Tacotron-2
DeepMind's Tacotron-2 Tensorflow implementation
Kyubyong/tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model