aman179102/podvoice
Local-first CLI that turns Markdown scripts into multi-speaker podcast-style audio using Coqui XTTS v2.
This tool helps content creators, podcasters, and educators turn written Markdown scripts into professional-sounding, multi-speaker audio. You provide a script where different speakers and their emotions are marked in Markdown, and it generates a single WAV or MP3 audio file with distinct voices for each speaker. It's ideal for anyone who wants to produce podcasts, audio lessons, or voiceovers without relying on cloud services or recurring subscriptions.
Use this if you need to create multi-speaker audio content from text scripts and prefer a fully offline, privacy-focused solution that runs locally on your computer.
Not ideal if you need highly customized, unique voices for every single production or require real-time voice synthesis for live applications.
Stars
25
Forks
10
Language
Python
License
MIT
Category
Last pushed
Feb 22, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/aman179102/podvoice"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
snakers4/silero-models
Silero Models: pre-trained text-to-speech models made embarrassingly simple
abus-aikorea/voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot...
JSchmie/ScrAIbe-WebUI
WebUI for ScAIbe
isaiahbjork/orpheus-tts-local
Run Orpheus 3B Locally With LM Studio
snakers4/silero-stress
Silero Stress — pre-trained enterprise-grade automated stress and homograph disambiguation for...