robmsmt/ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
This is a curated list of publicly available audio datasets for anyone working with speech technology. It provides direct links to various speech corpora, including spoken text and conversational audio, which can be used to train and evaluate speech recognition or text-to-speech systems. This resource is ideal for researchers, engineers, and data scientists developing speech applications.
231 stars. No commits in the last 6 months.
Use this if you need to find and download audio data to develop or test speech recognition (ASR) or text-to-speech (TTS) models.
Not ideal if you are looking for ready-to-use speech models or a tool to analyze audio, rather than raw audio data.
Stars
231
Forks
22
Language
Shell
License
Apache-2.0
Category
Last pushed
Aug 06, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/robmsmt/ASR-Audio-Data-Links"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ynop/audiomate
Python library for handling audio datasets.
reazon-research/ReazonSpeech
Massive open Japanese speech corpus
common-voice/cv-dataset
Metadata and versioning details for the Common Voice dataset
davidmartinrius/speech-dataset-generator
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset...
EgorLakomkin/KTSpeechCrawler
Automatically constructing corpus for automatic speech recognition from YouTube videos