AlexandaJerry/whisper-vits-japanese

Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)

/ 100

Emerging

This project helps Japanese content creators, voice actors, or anyone needing custom voiceovers to easily create their own text-to-speech models. You provide long audio recordings, and it automatically processes them into short, transcribed segments ready for training a unique AI voice. This is ideal for individuals or small teams looking to generate high-quality, customized Japanese speech from text without needing extensive technical knowledge.

162 stars. No commits in the last 6 months.

Use this if you have long Japanese audio files and want to train a custom text-to-speech model without manually segmenting or transcribing the audio.

Not ideal if you need a pre-trained, off-the-shelf Japanese text-to-speech solution or if you don't want to create your own custom voice model.

voice-synthesis audio-production speech-recognition content-creation Japanese-language

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

162

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

RVC-Boss/GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

High-Logic/Genie-TTS

GPT-SoVITS ONNX Inference Engine & Model Converter

chinokikiss/GSV-TTS-Lite

GSV-TTS-Lite A high-performance inference engine specifically designed for the GPT-SoVITS...

FENRlR/MB-iSTFT-VITS2

Application of MB-iSTFT-VITS components to vits2_pytorch

AlexandaJerry/vits-mandarin-biaobei

application of vits on mandarin tts

Explore Voice AI Tools

All categories Trending Voice AI directory Insights