NATSpeech/NATSpeech

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

45
/ 100
Emerging

This framework helps you convert written text into high-quality, natural-sounding speech, or even create singing voices from lyrics. You input text, and it outputs audio files of spoken or sung words. This is useful for content creators, game developers, or anyone needing to generate lifelike synthetic voices for various applications.

1,006 stars. No commits in the last 6 months.

Use this if you need to quickly and efficiently generate realistic speech or singing from text for your projects, especially if you want control over the quality and specific voice characteristics.

Not ideal if you're looking for a simple, plug-and-play web service for basic text-to-speech without any technical setup or customization.

audio-content-creation voice-synthesis game-development multimedia-production virtual-assistants
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 19 / 25

How are scores calculated?

Stars

1,006

Forks

102

Language

Python

License

MIT

Last pushed

Apr 02, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/NATSpeech/NATSpeech"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.