fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
This project helps creators, voice artists, or content developers generate natural-sounding speech from text in multiple languages. You input written text, and it outputs realistic spoken audio. It's designed for individuals who need to convert scripts or text into vocal performances for various multimedia applications.
8,707 stars.
Use this if you need to create speech from text for videos, games, audiobooks, or other content, especially when working with different languages.
Not ideal if you require an actively maintained project with ongoing updates, as the developers recommend their newer 'Fish-Speech' project as a replacement.
Stars
8,707
Forks
1,267
Language
Python
License
AGPL-3.0
Category
Last pushed
Mar 09, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/fishaudio/Bert-VITS2"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
travisvn/chatterbox-tts-api
Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate...
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment...
sfortis/openai_tts
Custom TTS component for Home Assistant. Utilizes the OpenAI speech engine or any compatible...
OpenMOSS/MOSS-TTSD
MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis....
OpenMOSS/MOSS-TTS
MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the...