kaiidams/Kokoro-Speech-Dataset

A public domain single speaker Japanese speech dataset

27
/ 100
Experimental

This dataset offers a large collection of Japanese speech audio clips along with their corresponding text. It takes audio recordings of 14 Japanese novels and provides precise text transcriptions in both original Japanese (Kanji-kana mixture) and romanized forms. It's designed for researchers and developers working on speech synthesis or voice recognition systems for the Japanese language.

No commits in the last 6 months.

Use this if you need a free, high-quality, single-speaker Japanese speech dataset to train or test your speech-to-text or text-to-speech models.

Not ideal if you require multi-speaker audio, specific emotional tones, or speech from domains other than Japanese literature.

Japanese language speech recognition text-to-speech natural language processing audio processing
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 8 / 25
Community 11 / 25

How are scores calculated?

Stars

65

Forks

7

Language

Python

License

Last pushed

Nov 05, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/kaiidams/Kokoro-Speech-Dataset"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.