PRITHIVSAKTHIUR/Qwen3-TTS-Daggr-UI

Demonstration for the Qwen/Qwen3-TTS-12Hz models using Daggr for modular UI nodes. Supports voice design (prompt-to-speech), voice cloning (zero-shot), and custom voice synthesis with multiple speakers and languages. Features lazy model loading to optimize memory, multi-model sizes (0.6B and 1.7B), ASR and support for various audio inputs.

/ 100

Experimental

This tool helps content creators, educators, or media producers generate lifelike speech from text or clone voices. You can input text, choose from various pre-defined voices and languages, or even provide a short audio sample to clone a voice. The output is high-quality synthesized audio or a transcription of spoken words, perfect for creating voiceovers, audiobooks, or interactive content.

Use this if you need to quickly generate spoken audio in multiple languages, create custom voices for characters, or transcribe audio files with high accuracy.

Not ideal if you require extremely nuanced, emotion-rich vocal performances that can only be achieved by professional human voice actors.

content-creation voice-acting e-learning audio-production multilingual-communication

No Package No Dependents

Maintenance 10 / 25

Adoption 4 / 25

Maturity 11 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

Apache-2.0

Higher-rated alternatives

BoltzmannEntropy/MimikaStudio

MimikaStudio - A local-first application for macOS (Apple Silicon) + Agentic MCP Support

aahl/qwen-asr2api

🎤 Qwen 3 ASR to OpenAI API, 免费STT语音识别模型

gabriele-mastrapasqua/qwen3-tts

Pure C inference engine for Qwen3-TTS text-to-speech. No Python, no PyTorch — just C and BLAS....

zhao-kun/VibeVoiceFusion

VibeVoiceFusion is a full-stack, multi-speaker voice generation web system featuring LoRA...

shijincai/VibeVoice

Archive of the official Microsoft VibeVoice repository (7B & 1.5B). Backup of the deleted source...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights