keonlee9420/StyleSpeech

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

42
/ 100
Emerging

This project helps content creators and educators generate natural-sounding speech from text in a specific voice style. You provide written text and a short audio clip of a target speaker's voice, and it produces an audio file of your text spoken in that same voice. Anyone who needs to create custom voiceovers or synthetic speech that matches an existing voice could use this.

197 stars. No commits in the last 6 months.

Use this if you need to generate high-quality, natural-sounding speech from text that mimics the style, pitch, and tone of a specific reference speaker.

Not ideal if you only need generic text-to-speech without specific voice adaptation or if you require fine-grained control over individual style factors beyond basic pitch, volume, and speaking rate.

voice-synthesis audio-content-creation e-learning digital-narration media-production
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 16 / 25

How are scores calculated?

Stars

197

Forks

23

Language

Python

License

MIT

Last pushed

Feb 10, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/keonlee9420/StyleSpeech"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.