Mobile-Artificial-Intelligence/babylon
Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.
This project helps integrate text-to-speech capabilities into applications, allowing them to convert written text into spoken audio. You provide written words or sentences, and it produces an audio file of that text being spoken. This tool is for application developers or system integrators building voice-enabled features.
Use this if you need to add offline, high-quality text-to-speech synthesis directly into your C or C++ application, without relying on cloud services.
Not ideal if you need a plug-and-play solution without any programming, or if your application isn't built with C/C++.
Stars
30
Forks
3
Language
Python
License
MIT
Category
Last pushed
Mar 09, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Mobile-Artificial-Intelligence/babylon"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
pnnbao97/VieNeu-TTS
Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio...
CorentinJ/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
babysor/MockingBird
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
r9y9/nnmnkwii
Library to build speech synthesis systems designed for easy and fast prototyping.
Softcatala/open-dubbing
Open dubbing is an AI dubbing system which uses machine learning models to automatically...