kaiidams/Kokoro-Speech-Dataset
A public domain single speaker Japanese speech dataset
This dataset offers a large collection of Japanese speech audio clips along with their corresponding text. It takes audio recordings of 14 Japanese novels and provides precise text transcriptions in both original Japanese (Kanji-kana mixture) and romanized forms. It's designed for researchers and developers working on speech synthesis or voice recognition systems for the Japanese language.
No commits in the last 6 months.
Use this if you need a free, high-quality, single-speaker Japanese speech dataset to train or test your speech-to-text or text-to-speech models.
Not ideal if you require multi-speaker audio, specific emotional tones, or speech from domains other than Japanese literature.
Stars
65
Forks
7
Language
Python
License
—
Category
Last pushed
Nov 05, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/kaiidams/Kokoro-Speech-Dataset"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
thewh1teagle/kokoro-onnx
TTS with kokoro and onnx runtime
nazdridoy/kokoro-tts
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with...
remsky/Kokoro-FastAPI
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch...
Lyrcaxis/KokoroSharp
Fast local TTS inference engine in C# with ONNX runtime. Multi-speaker, multi-platform and...
met4citizen/HeadTTS
HeadTTS: Free neural text-to-speech (Kokoro) with timestamps and visemes for lip-sync. Runs...