PRITHIVSAKTHIUR/Qwen3-TTS-Daggr-UI

Demonstration for the Qwen/Qwen3-TTS-12Hz models using Daggr for modular UI nodes. Supports voice design (prompt-to-speech), voice cloning (zero-shot), and custom voice synthesis with multiple speakers and languages. Features lazy model loading to optimize memory, multi-model sizes (0.6B and 1.7B), ASR and support for various audio inputs.

25
/ 100
Experimental

This tool helps content creators, educators, or media producers generate lifelike speech from text or clone voices. You can input text, choose from various pre-defined voices and languages, or even provide a short audio sample to clone a voice. The output is high-quality synthesized audio or a transcription of spoken words, perfect for creating voiceovers, audiobooks, or interactive content.

Use this if you need to quickly generate spoken audio in multiple languages, create custom voices for characters, or transcribe audio files with high accuracy.

Not ideal if you require extremely nuanced, emotion-rich vocal performances that can only be achieved by professional human voice actors.

content-creation voice-acting e-learning audio-production multilingual-communication
No Package No Dependents
Maintenance 10 / 25
Adoption 4 / 25
Maturity 11 / 25
Community 0 / 25

How are scores calculated?

Stars

7

Forks

Language

Python

License

Apache-2.0

Last pushed

Feb 12, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/PRITHIVSAKTHIUR/Qwen3-TTS-Daggr-UI"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.