AlexandaJerry/whisper-vits-japanese
Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)
This project helps Japanese content creators, voice actors, or anyone needing custom voiceovers to easily create their own text-to-speech models. You provide long audio recordings, and it automatically processes them into short, transcribed segments ready for training a unique AI voice. This is ideal for individuals or small teams looking to generate high-quality, customized Japanese speech from text without needing extensive technical knowledge.
162 stars. No commits in the last 6 months.
Use this if you have long Japanese audio files and want to train a custom text-to-speech model without manually segmenting or transcribing the audio.
Not ideal if you need a pre-trained, off-the-shelf Japanese text-to-speech solution or if you don't want to create your own custom voice model.
Stars
162
Forks
28
Language
Jupyter Notebook
License
MIT
Category
Last pushed
May 07, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/AlexandaJerry/whisper-vits-japanese"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
High-Logic/Genie-TTS
GPT-SoVITS ONNX Inference Engine & Model Converter
chinokikiss/GSV-TTS-Lite
GSV-TTS-Lite A high-performance inference engine specifically designed for the GPT-SoVITS...
FENRlR/MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch
AlexandaJerry/vits-mandarin-biaobei
application of vits on mandarin tts