keonlee9420/Deep-Learning-TTS-Template

This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).

/ 100

Experimental

This project offers a foundational structure for building a Text-to-Speech (TTS) system that converts written text into natural-sounding spoken audio. It provides the essential components like data handling, model training, and audio generation tools. The primary users are researchers or engineers who are developing custom deep learning-based TTS models.

No commits in the last 6 months.

Use this if you are developing a new non-autoregressive deep learning Text-to-Speech model and need a ready-made pipeline for data preparation, training, and synthesis.

Not ideal if you are a non-developer seeking an out-of-the-box solution to generate speech from text, or if you need to deploy an existing TTS model without customization.

Speech Synthesis Deep Learning Research Machine Learning Engineering Voice AI Development

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Higher-rated alternatives

TensorSpeech/TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for...

lucasnewman/nanospeech

A simple, hackable text-to-speech system in PyTorch and MLX

Tomiinek/Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing,...

keonlee9420/STYLER

Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech...

jxzhanggg/nonparaSeq2seqVC_code

Implementation code of non-parallel sequence-to-sequence VC

Explore Voice AI Tools

All categories Trending Voice AI directory Insights