ga642381/FastSpeech2

Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech :fist:

36
/ 100
Emerging

This project helps create high-quality, natural-sounding speech from written text, often for applications like audiobooks or virtual assistants. You provide text and, optionally, existing audio recordings from a speaker, and it generates spoken audio in that voice. This is ideal for content creators, audiobook producers, or anyone needing to generate speech in a consistent voice from various text inputs.

No commits in the last 6 months.

Use this if you need to convert large amounts of text into speech, especially if you want to use multiple different voices or replicate a specific speaker's voice.

Not ideal if you're looking for a simple, ready-to-use text-to-speech API without needing to train a custom model or manage the underlying infrastructure.

audiobook-production voice-synthesis content-creation digital-publishing
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 8 / 25
Community 19 / 25

How are scores calculated?

Stars

99

Forks

19

Language

Python

License

Last pushed

Oct 14, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ga642381/FastSpeech2"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.