hcy71o/SNAC

Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech

/ 100

Emerging

This project helps create realistic, human-like speech from written text, even for voices it hasn't heard before. You provide text and a small audio sample of a target voice, and it generates that text spoken in the new voice. This is ideal for content creators, educators, or anyone needing to produce custom voiceovers efficiently without hiring professional voice actors.

No commits in the last 6 months.

Use this if you need to generate high-quality, custom voice narration for text in a voice that sounds natural and consistent, even if you only have a short sample of that voice.

Not ideal if you're looking for a simple, off-the-shelf text-to-speech solution without custom voice generation capabilities, or if you need to generate speech in many different languages without specific voice cloning.

voice-synthesis audio-production content-creation narration media-localization

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

bshall/Tacotron

A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis

Kyubyong/dc_tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

DemisEom/SpecAugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Rayhane-mamah/Tacotron-2

DeepMind's Tacotron-2 Tensorflow implementation

Kyubyong/tacotron

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Explore Voice AI Tools

All categories Trending Voice AI directory Insights