NATSpeech/NATSpeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

/ 100

Emerging

This framework helps you convert written text into high-quality, natural-sounding speech, or even create singing voices from lyrics. You input text, and it outputs audio files of spoken or sung words. This is useful for content creators, game developers, or anyone needing to generate lifelike synthetic voices for various applications.

1,006 stars. No commits in the last 6 months.

Use this if you need to quickly and efficiently generate realistic speech or singing from text for your projects, especially if you want control over the quality and specific voice characteristics.

Not ideal if you're looking for a simple, plug-and-play web service for basic text-to-speech without any technical setup or customization.

audio-content-creation voice-synthesis game-development multimedia-production virtual-assistants

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

1,006

Forks

102

Language

Python

License

MIT

Higher-rated alternatives

TensorSpeech/TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for...

lucasnewman/nanospeech

A simple, hackable text-to-speech system in PyTorch and MLX

Tomiinek/Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing,...

keonlee9420/STYLER

Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech...

jxzhanggg/nonparaSeq2seqVC_code

Implementation code of non-parallel sequence-to-sequence VC

Explore Voice AI Tools

All categories Trending Voice AI directory Insights