voicebox and vox-box

These are **competitors**: both provide self-hosted TTS server solutions with open-source backends, though voicebox offers a broader visual studio interface while vox-box emphasizes OpenAI API compatibility across multiple synthesis engines.

voicebox
67
Established
vox-box
50
Established
Maintenance 25/25
Adoption 10/25
Maturity 11/25
Community 21/25
Maintenance 6/25
Adoption 10/25
Maturity 16/25
Community 18/25
Stars: 13,404
Forks: 1,562
Downloads:
Commits (30d): 174
Language: TypeScript
License: MIT
Stars: 200
Forks: 32
Downloads:
Commits (30d): 0
Language: Python
License: Apache-2.0
No Package No Dependents
No Package No Dependents

About voicebox

jamiepine/voicebox

The open-source voice synthesis studio

Voicebox is an open-source voice synthesis studio that allows you to clone voices from short audio samples and generate speech in multiple languages with various effects. You can input text and existing voice recordings to create high-quality, expressive spoken audio. This tool is ideal for content creators, podcasters, game developers, or anyone needing realistic, customizable voiceovers.

voiceover-production podcast-creation audiobook-narration content-creation game-audio

About vox-box

gpustack/vox-box

A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

This tool allows developers to quickly set up a server for converting spoken audio into written text or turning written text into natural-sounding speech. You input audio files or written text, and it outputs the corresponding text transcriptions or audio narration. It's designed for developers building applications that need robust speech recognition or text-to-speech capabilities, such as voice assistants or content creation tools.

application-development voice-technology AI-integration speech-recognition audio-narration

Scores updated daily from GitHub, PyPI, and npm data. How scores work