double22a/speech_dataset

The dataset of Speech Recognition

54
/ 100
Established

This is a curated list of publicly available speech datasets that are essential for building and improving speech recognition and speech synthesis systems. It provides direct links to datasets of recorded speech, often with accompanying text transcripts, in various languages like Chinese, English, Japanese, and more. Anyone working on speech technology, such as AI researchers or machine learning engineers developing voice assistants or transcription services, would find this resource invaluable for finding the right data.

453 stars.

Use this if you need to find diverse, high-quality speech data in multiple languages to train or evaluate your speech recognition, speech synthesis, or speaker diarization models.

Not ideal if you are looking for a pre-built model or an API for speech processing, rather than raw datasets for development.

speech-recognition speech-synthesis speaker-diarization voice-technology-development natural-language-processing
No Package No Dependents
Maintenance 6 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 22 / 25

How are scores calculated?

Stars

453

Forks

81

Language

License

Apache-2.0

Last pushed

Jan 04, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/double22a/speech_dataset"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.