jamiepine/voicebox
The open-source voice synthesis studio
Voicebox is an open-source voice synthesis studio that allows you to clone voices from short audio samples and generate speech in multiple languages with various effects. You can input text and existing voice recordings to create high-quality, expressive spoken audio. This tool is ideal for content creators, podcasters, game developers, or anyone needing realistic, customizable voiceovers.
13,404 stars. Actively maintained with 174 commits in the last 30 days.
Use this if you need to create custom voiceovers, narrations, or conversational audio with unique voices and professional audio effects, all while keeping your data private and local.
Not ideal if you prefer cloud-based services for voice synthesis or require a solution that integrates directly into web browsers without a local application.
Stars
13,404
Forks
1,562
Language
TypeScript
License
MIT
Category
Last pushed
Mar 18, 2026
Commits (30d)
174
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/jamiepine/voicebox"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Recent Releases
Related tools
devnen/Chatterbox-TTS-Server
Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible...
daswer123/xtts-api-server
A simple FastAPI Server to run XTTSv2
Aivis-Project/AivisSpeech-Engine
AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine
jianchang512/ChatTTS-ui
一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to...
gokhaneraslan/chatterbox-finetuning
Fine-tuning toolkit for Chatterbox TTS & Chatterbox TURBO models. Supports 23 languages with...