CMsmartvoice/Unet-TTS

One-shot TTS with Improved Unseen Speaker and Style Transfer

30
/ 100
Emerging

This tool helps content creators and developers generate natural-sounding speech in a specific voice and style, even for voices they haven't encountered before. You provide a short audio sample of the target voice and any text you want spoken, and it outputs synthesized speech that mimics the speaker's unique characteristics and emotional tone. It's ideal for anyone needing to create personalized audio content, like voiceovers or virtual assistants, with minimal effort.

No commits in the last 6 months.

Use this if you need to quickly clone a voice and speaking style from a very short audio clip to synthesize new, arbitrary text.

Not ideal if you require extremely high-fidelity, indistinguishable voice cloning for sensitive applications where even minor artificiality is unacceptable.

voice-cloning audio-content-creation speech-synthesis virtual-assistants e-learning-narration
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 8 / 25
Community 15 / 25

How are scores calculated?

Stars

37

Forks

7

Language

License

Last pushed

Mar 02, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/CMsmartvoice/Unet-TTS"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.