ynop/audiomate

Python library for handling audio datasets.

/ 100

Established

This tool helps researchers and machine learning engineers streamline their work with audio datasets. It lets you easily access, load, and manage various audio collections, making it simpler to prepare audio data for tasks like training speech recognition models. You can pull in raw audio files and their associated labels, and the output is a structured dataset ready for analysis or model input.

138 stars. No commits in the last 6 months. Available on PyPI.

Use this if you are an audio researcher or ML engineer who needs to efficiently work with diverse audio datasets, including downloading, loading, and performing operations like splitting or merging.

Not ideal if you are a casual user looking for a simple audio player or editor, as this tool is focused on programmatic data management for advanced analytical tasks.

audio-processing speech-recognition machine-learning-data-prep sound-classification digital-humanities

Stale 6m

Maintenance 0 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 19 / 25

How are scores calculated?

Stars

138

Forks

Language

Python

License

MIT

Related tools

reazon-research/ReazonSpeech

Massive open Japanese speech corpus

common-voice/cv-dataset

Metadata and versioning details for the Common Voice dataset

davidmartinrius/speech-dataset-generator

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset...

EgorLakomkin/KTSpeechCrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

coqui-ai/open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Explore Voice AI Tools

All categories Trending Voice AI directory Insights