OpenBMB/VoxCPM

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

75
/ 100
Verified

This tool helps content creators, marketers, educators, and anyone needing realistic voiceovers or personalized audio by converting written text into natural-sounding speech. You provide text and, optionally, a short audio sample of a voice you want to clone, and it generates an audio file with expressive, context-aware speech in that voice. It's for professionals who need high-quality, lifelike synthetic voices for various applications.

6,143 stars. Actively maintained with 36 commits in the last 30 days. Available on PyPI.

Use this if you need to generate highly expressive, human-like speech from text or to accurately clone a voice, including its emotion, accent, and pacing, from a short audio clip.

Not ideal if you require a simpler, less nuanced text-to-speech solution or if your primary need is basic speech generation without advanced expressiveness or precise voice cloning.

voice-cloning audio-content-creation speech-synthesis digital-narration marketing-audio
Maintenance 20 / 25
Adoption 10 / 25
Maturity 24 / 25
Community 21 / 25

How are scores calculated?

Stars

6,143

Forks

744

Language

Python

License

Apache-2.0

Last pushed

Mar 13, 2026

Commits (30d)

36

Dependencies

20

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/OpenBMB/VoxCPM"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.