ynop/audiomate
Python library for handling audio datasets.
This tool helps researchers and machine learning engineers streamline their work with audio datasets. It lets you easily access, load, and manage various audio collections, making it simpler to prepare audio data for tasks like training speech recognition models. You can pull in raw audio files and their associated labels, and the output is a structured dataset ready for analysis or model input.
138 stars. No commits in the last 6 months. Available on PyPI.
Use this if you are an audio researcher or ML engineer who needs to efficiently work with diverse audio datasets, including downloading, loading, and performing operations like splitting or merging.
Not ideal if you are a casual user looking for a simple audio player or editor, as this tool is focused on programmatic data management for advanced analytical tasks.
Stars
138
Forks
25
Language
Python
License
MIT
Category
Last pushed
Jul 06, 2023
Commits (30d)
0
Dependencies
11
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/ynop/audiomate"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
reazon-research/ReazonSpeech
Massive open Japanese speech corpus
common-voice/cv-dataset
Metadata and versioning details for the Common Voice dataset
davidmartinrius/speech-dataset-generator
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset...
EgorLakomkin/KTSpeechCrawler
Automatically constructing corpus for automatic speech recognition from YouTube videos
coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies