Mobile-Artificial-Intelligence/babylon

Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port of the DeepPhonemizer model is used. For speech synthesis VITS models are used. Piper models are compatible after a conversion script is run.

/ 100

Emerging

This project helps integrate text-to-speech capabilities into applications, allowing them to convert written text into spoken audio. You provide written words or sentences, and it produces an audio file of that text being spoken. This tool is for application developers or system integrators building voice-enabled features.

Use this if you need to add offline, high-quality text-to-speech synthesis directly into your C or C++ application, without relying on cloud services.

Not ideal if you need a plug-and-play solution without any programming, or if your application isn't built with C/C++.

application-development voice-user-interface speech-synthesis offline-processing

No Package No Dependents

Maintenance 10 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

pnnbao97/VieNeu-TTS

Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio...

CorentinJ/Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

babysor/MockingBird

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

r9y9/nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

Softcatala/open-dubbing

Open dubbing is an AI dubbing system which uses machine learning models to automatically...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights