german-asr/megs

A merged version of multiple open-source German speech datasets.

/ 100

Experimental

Building accurate German Automatic Speech Recognition (ASR) models requires a large amount of spoken German audio and corresponding transcripts. This project gathers several open-source German speech datasets and combines them into one unified, ready-to-use corpus. It provides organized audio files and text transcripts, which are crucial for training and evaluating ASR systems. This is for researchers and developers working on German language technology.

No commits in the last 6 months.

Use this if you need a comprehensive, pre-processed collection of German speech data to train or benchmark an Automatic Speech Recognition (ASR) system.

Not ideal if you are looking for real-time speech processing tools or an off-the-shelf German ASR model rather than the underlying training data.

Automatic Speech Recognition German Language Processing Speech Technology Machine Learning Datasets Natural Language Processing

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 6 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

ynop/audiomate

Python library for handling audio datasets.

reazon-research/ReazonSpeech

Massive open Japanese speech corpus

common-voice/cv-dataset

Metadata and versioning details for the Common Voice dataset

davidmartinrius/speech-dataset-generator

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset...

EgorLakomkin/KTSpeechCrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

Explore Voice AI Tools

All categories Trending Voice AI directory Insights