ZET-Speech/ZET-Speech-Demo

ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models (TTS)

13
/ 100
Experimental

This tool helps content creators, educators, or anyone needing to generate speech with specific emotions from text. You provide written text and an audio sample demonstrating the desired emotional tone, and it produces a natural-sounding audio recording of your text, spoken with that emotion. It's designed for professionals who need high-quality, emotionally expressive voiceovers without hiring voice actors for every nuance.

No commits in the last 6 months.

Use this if you need to quickly generate spoken audio that conveys a specific emotional style, based on a short example.

Not ideal if you require precise, syllable-level control over speech elements or if you only need a standard, unemotional voice.

content-creation voiceover-production e-learning-audio narration audio-content
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 8 / 25
Community 0 / 25

How are scores calculated?

Stars

10

Forks

Language

JavaScript

License

Last pushed

Mar 09, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ZET-Speech/ZET-Speech-Demo"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.