german-asr/megs
A merged version of multiple open-source German speech datasets.
Building accurate German Automatic Speech Recognition (ASR) models requires a large amount of spoken German audio and corresponding transcripts. This project gathers several open-source German speech datasets and combines them into one unified, ready-to-use corpus. It provides organized audio files and text transcripts, which are crucial for training and evaluating ASR systems. This is for researchers and developers working on German language technology.
No commits in the last 6 months.
Use this if you need a comprehensive, pre-processed collection of German speech data to train or benchmark an Automatic Speech Recognition (ASR) system.
Not ideal if you are looking for real-time speech processing tools or an off-the-shelf German ASR model rather than the underlying training data.
Stars
34
Forks
2
Language
Jupyter Notebook
License
MIT
Category
Last pushed
May 03, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/german-asr/megs"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ynop/audiomate
Python library for handling audio datasets.
reazon-research/ReazonSpeech
Massive open Japanese speech corpus
common-voice/cv-dataset
Metadata and versioning details for the Common Voice dataset
davidmartinrius/speech-dataset-generator
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset...
EgorLakomkin/KTSpeechCrawler
Automatically constructing corpus for automatic speech recognition from YouTube videos