double22a/speech_dataset

The dataset of Speech Recognition

/ 100

Established

This is a curated list of publicly available speech datasets that are essential for building and improving speech recognition and speech synthesis systems. It provides direct links to datasets of recorded speech, often with accompanying text transcripts, in various languages like Chinese, English, Japanese, and more. Anyone working on speech technology, such as AI researchers or machine learning engineers developing voice assistants or transcription services, would find this resource invaluable for finding the right data.

453 stars.

Use this if you need to find diverse, high-quality speech data in multiple languages to train or evaluate your speech recognition, speech synthesis, or speaker diarization models.

Not ideal if you are looking for a pre-built model or an API for speech processing, rather than raw datasets for development.

speech-recognition speech-synthesis speaker-diarization voice-technology-development natural-language-processing

No Package No Dependents

Maintenance 6 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 22 / 25

How are scores calculated?

Stars

453

Forks

Language

—

License

Apache-2.0

Related tools

Jakobovski/free-spoken-digit-dataset

A free audio dataset of spoken digits. An audio version of MNIST.

Ijwi-ry-Ikirundi-AI/Kirundi_Dataset

🇧🇮 The first large-scale, open-source speech and text dataset for Kirundi language. Building AI...

lottev1991/Project-AIdol-Public-English-Dataset

Public female English corpus used for Project AI❤dol

Jahangirbd23/WenetSpeech-Yue

📑 Explore WenetSpeech-Yue, a comprehensive Cantonese speech corpus with rich annotations,...

Nexdata-AI/338-Hours-Spanish-Speech-Data-by-Mobile-Phone

Spanish Speech Dataset

Explore Voice AI Tools

All categories Trending Voice AI directory Insights