tcsenpai/audiocoqui
A multilingual tool to convert PDF ebooks to audiobooks using XTTS v2 TTS model by cloning a speaker voice.
This tool helps you transform your PDF ebooks into personalized audiobooks. You provide a PDF document and a short audio sample of a voice you like, and it generates a complete audiobook spoken in that cloned voice. This is ideal for anyone who prefers listening to books or wants to make their digital library more accessible.
No commits in the last 6 months.
Use this if you want to convert your digital books into audio format with a consistent, custom voice.
Not ideal if you need professional voice acting, precise control over intonation, or already have audio versions of your PDFs.
Stars
18
Forks
4
Language
Python
License
—
Category
Last pushed
Jan 22, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/tcsenpai/audiocoqui"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
huggingface/speech-to-speech
Build local voice agents with open-source models
linto-ai/WebVoiceSDK
Buildings block for voice-enabled applications in the browser
Picovoice/speech-to-text-benchmark
speech to text benchmark framework
vox-serve/vox-serve
A Streaming-Native Serving Engine for TTS/STS Models