kanttouchthis/text_generation_webui_xtts
XTTSv2 Extension for oobabooga text-generation-webui
This helps content creators and educators generate natural-sounding speech in multiple languages from written text. You provide text and a short audio clip of a voice, and it outputs an audio file of that text spoken in the cloned voice and chosen language. This is ideal for podcasters, e-learning developers, or animators needing consistent voiceovers.
156 stars. No commits in the last 6 months.
Use this if you need to quickly create multilingual audio content with a consistent, cloned voice.
Not ideal if you prefer using the officially supported XTTSv2 extension already built into text-generation-webui.
Stars
156
Forks
18
Language
Python
License
—
Category
Last pushed
Nov 21, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/kanttouchthis/text_generation_webui_xtts"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
travisvn/chatterbox-tts-api
Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate...
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment...
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
sfortis/openai_tts
Custom TTS component for Home Assistant. Utilizes the OpenAI speech engine or any compatible...
OpenMOSS/MOSS-TTSD
MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis....