vorojar/VibeVoice
Open-source AI audiobook studio. A free, private alternative to ElevenLabs. 3 voice modes, per-sentence voice & emotion control, LLM smart character analysis, mixed-voice generation. Runs 100% locally on your GPU with zero API costs.
This is an AI audiobook studio that helps authors, publishers, or content creators convert written text into natural-sounding audiobooks. You input a script or manuscript, and it generates audio narration with customizable voices and emotions for each sentence. The primary users are individuals or small teams looking to produce audio content like audiobooks, podcasts, or narrated articles without cloud service costs.
Use this if you need fine-grained control over voice and emotion for different characters and narration in an audiobook, prefer a free and private solution, and have a local GPU.
Not ideal if you require real-time streaming audio generation, need extensive language support beyond the 10 offered, or lack a powerful local GPU for processing.
Stars
14
Forks
1
Language
JavaScript
License
—
Category
Last pushed
Feb 20, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/vorojar/VibeVoice"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
BoltzmannEntropy/MimikaStudio
MimikaStudio - A local-first application for macOS (Apple Silicon) + Agentic MCP Support
aahl/qwen-asr2api
🎤 Qwen 3 ASR to OpenAI API, 免费STT语音识别模型
gabriele-mastrapasqua/qwen3-tts
Pure C inference engine for Qwen3-TTS text-to-speech. No Python, no PyTorch — just C and BLAS....
zhao-kun/VibeVoiceFusion
VibeVoiceFusion is a full-stack, multi-speaker voice generation web system featuring LoRA...
shijincai/VibeVoice
Archive of the official Microsoft VibeVoice repository (7B & 1.5B). Backup of the deleted source...