lucasnewman/e2-tts-mlx

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX

/ 100

Emerging

This project helps developers and researchers create custom text-to-speech (TTS) systems. It takes written text and a reference audio sample, and outputs generated speech that matches the voice from the reference. This tool is for machine learning engineers, AI researchers, and audio developers working on speech synthesis applications.

No commits in the last 6 months.

Use this if you need to quickly train a new text-to-speech model that can adapt to different voices from short audio samples, without complex data alignment.

Not ideal if you're an end-user looking for a ready-to-use text-to-speech application; this is a development tool for building such systems.

speech-synthesis text-to-speech zero-shot-learning machine-learning-development audio-generation

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

TensorSpeech/TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for...

lucasnewman/nanospeech

A simple, hackable text-to-speech system in PyTorch and MLX

Tomiinek/Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing,...

keonlee9420/STYLER

Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech...

jxzhanggg/nonparaSeq2seqVC_code

Implementation code of non-parallel sequence-to-sequence VC

Explore Voice AI Tools

All categories Trending Voice AI directory Insights