double22a/speech_dataset
The dataset of Speech Recognition
This is a curated list of publicly available speech datasets that are essential for building and improving speech recognition and speech synthesis systems. It provides direct links to datasets of recorded speech, often with accompanying text transcripts, in various languages like Chinese, English, Japanese, and more. Anyone working on speech technology, such as AI researchers or machine learning engineers developing voice assistants or transcription services, would find this resource invaluable for finding the right data.
453 stars.
Use this if you need to find diverse, high-quality speech data in multiple languages to train or evaluate your speech recognition, speech synthesis, or speaker diarization models.
Not ideal if you are looking for a pre-built model or an API for speech processing, rather than raw datasets for development.
Stars
453
Forks
81
Language
—
License
Apache-2.0
Category
Last pushed
Jan 04, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/double22a/speech_dataset"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
Jakobovski/free-spoken-digit-dataset
A free audio dataset of spoken digits. An audio version of MNIST.
Ijwi-ry-Ikirundi-AI/Kirundi_Dataset
🇧🇮 The first large-scale, open-source speech and text dataset for Kirundi language. Building AI...
lottev1991/Project-AIdol-Public-English-Dataset
Public female English corpus used for Project AI❤dol
Jahangirbd23/WenetSpeech-Yue
📑 Explore WenetSpeech-Yue, a comprehensive Cantonese speech corpus with rich annotations,...
Nexdata-AI/338-Hours-Spanish-Speech-Data-by-Mobile-Phone
Spanish Speech Dataset