fishaudio/Bert-VITS2

vits2 backbone with multilingual-bert

/ 100

Established

This project helps creators, voice artists, or content developers generate natural-sounding speech from text in multiple languages. You input written text, and it outputs realistic spoken audio. It's designed for individuals who need to convert scripts or text into vocal performances for various multimedia applications.

8,707 stars.

Use this if you need to create speech from text for videos, games, audiobooks, or other content, especially when working with different languages.

Not ideal if you require an actively maintained project with ongoing updates, as the developers recommend their newer 'Fish-Speech' project as a replacement.

voice-synthesis content-creation multilingual-audio text-to-speech digital-voice

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 22 / 25

How are scores calculated?

Stars

8,707

Forks

1,267

Language

Python

License

AGPL-3.0

Related tools

travisvn/chatterbox-tts-api

Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate...

FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment...

sfortis/openai_tts

Custom TTS component for Home Assistant. Utilizes the OpenAI speech engine or any compatible...

OpenMOSS/MOSS-TTSD

MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis....

OpenMOSS/MOSS-TTS

MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights