PRITHIVSAKTHIUR/Qwen3-TTS-Daggr-UI
Demonstration for the Qwen/Qwen3-TTS-12Hz models using Daggr for modular UI nodes. Supports voice design (prompt-to-speech), voice cloning (zero-shot), and custom voice synthesis with multiple speakers and languages. Features lazy model loading to optimize memory, multi-model sizes (0.6B and 1.7B), ASR and support for various audio inputs.
This tool helps content creators, educators, or media producers generate lifelike speech from text or clone voices. You can input text, choose from various pre-defined voices and languages, or even provide a short audio sample to clone a voice. The output is high-quality synthesized audio or a transcription of spoken words, perfect for creating voiceovers, audiobooks, or interactive content.
Use this if you need to quickly generate spoken audio in multiple languages, create custom voices for characters, or transcribe audio files with high accuracy.
Not ideal if you require extremely nuanced, emotion-rich vocal performances that can only be achieved by professional human voice actors.
Stars
7
Forks
—
Language
Python
License
Apache-2.0
Category
Last pushed
Feb 12, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/PRITHIVSAKTHIUR/Qwen3-TTS-Daggr-UI"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
BoltzmannEntropy/MimikaStudio
MimikaStudio - A local-first application for macOS (Apple Silicon) + Agentic MCP Support
aahl/qwen-asr2api
🎤 Qwen 3 ASR to OpenAI API, 免费STT语音识别模型
gabriele-mastrapasqua/qwen3-tts
Pure C inference engine for Qwen3-TTS text-to-speech. No Python, no PyTorch — just C and BLAS....
zhao-kun/VibeVoiceFusion
VibeVoiceFusion is a full-stack, multi-speaker voice generation web system featuring LoRA...
shijincai/VibeVoice
Archive of the official Microsoft VibeVoice repository (7B & 1.5B). Backup of the deleted source...