e-c-k-e-r/vall-e

An unofficial PyTorch implementation of VALL-E

/ 100

Emerging

This project helps you create realistic-sounding speech from text using very little original audio. You provide written text and a short audio sample of the voice you want to clone, and it generates new speech in that same voice. This tool is useful for content creators, voice artists, or anyone needing to generate customized spoken audio.

No commits in the last 6 months.

Use this if you need to quickly generate new speech in a specific voice without extensive recording, perfect for tasks like audiobook narration, podcast snippets, or character dialogue.

Not ideal if you need to precisely edit existing audio recordings or require extremely nuanced emotional expression beyond what text-to-speech can provide.

voice-cloning text-to-speech audio-content-creation digital-voice-synthesis podcast-production

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

AGPL-3.0

Higher-rated alternatives

bshall/Tacotron

A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis

Kyubyong/dc_tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

DemisEom/SpecAugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Rayhane-mamah/Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

Kyubyong/tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Explore Voice AI Tools

All categories Trending Voice AI directory Insights