vorojar/VibeVoice

Open-source AI audiobook studio. A free, private alternative to ElevenLabs. 3 voice modes, per-sentence voice & emotion control, LLM smart character analysis, mixed-voice generation. Runs 100% locally on your GPU with zero API costs.

/ 100

Experimental

This is an AI audiobook studio that helps authors, publishers, or content creators convert written text into natural-sounding audiobooks. You input a script or manuscript, and it generates audio narration with customizable voices and emotions for each sentence. The primary users are individuals or small teams looking to produce audio content like audiobooks, podcasts, or narrated articles without cloud service costs.

Use this if you need fine-grained control over voice and emotion for different characters and narration in an audiobook, prefer a free and private solution, and have a local GPU.

Not ideal if you require real-time streaming audio generation, need extensive language support beyond the 10 offered, or lack a powerful local GPU for processing.

audiobook-production voice-narration content-creation podcast-production digital-publishing

No License No Package No Dependents

Maintenance 10 / 25

Adoption 5 / 25

Maturity 3 / 25

Community 6 / 25

How are scores calculated?

Stars

Forks

Language

JavaScript

License

—

Higher-rated alternatives

BoltzmannEntropy/MimikaStudio

MimikaStudio - A local-first application for macOS (Apple Silicon) + Agentic MCP Support

aahl/qwen-asr2api

🎤 Qwen 3 ASR to OpenAI API, 免费STT语音识别模型

gabriele-mastrapasqua/qwen3-tts

Pure C inference engine for Qwen3-TTS text-to-speech. No Python, no PyTorch — just C and BLAS....

zhao-kun/VibeVoiceFusion

VibeVoiceFusion is a full-stack, multi-speaker voice generation web system featuring LoRA...

shijincai/VibeVoice

Archive of the official Microsoft VibeVoice repository (7B & 1.5B). Backup of the deleted source...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights