kaiidams/Kokoro-Speech-Dataset

A public domain single speaker Japanese speech dataset

/ 100

Experimental

This dataset offers a large collection of Japanese speech audio clips along with their corresponding text. It takes audio recordings of 14 Japanese novels and provides precise text transcriptions in both original Japanese (Kanji-kana mixture) and romanized forms. It's designed for researchers and developers working on speech synthesis or voice recognition systems for the Japanese language.

No commits in the last 6 months.

Use this if you need a free, high-quality, single-speaker Japanese speech dataset to train or test your speech-to-text or text-to-speech models.

Not ideal if you require multi-speaker audio, specific emotional tones, or speech from domains other than Japanese literature.

Japanese language speech recognition text-to-speech natural language processing audio processing

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Featured in

Choosing a Voice AI Library in 2026: What's Actually Worth Building On

Higher-rated alternatives

thewh1teagle/kokoro-onnx

TTS with kokoro and onnx runtime

nazdridoy/kokoro-tts

A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with...

remsky/Kokoro-FastAPI

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch...

Lyrcaxis/KokoroSharp

Fast local TTS inference engine in C# with ONNX runtime. Multi-speaker, multi-platform and...

met4citizen/HeadTTS

HeadTTS: Free neural text-to-speech (Kokoro) with timestamps and visemes for lip-sync. Runs...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights