audeering/audformat
Format to store media files and annotations
This tool provides a standardized way to store collections of media files like audio, video, or text alongside their detailed annotations. It helps machine learning engineers or researchers organize their datasets, making it straightforward to swap or combine different datasets for model training and evaluation. You input raw media and annotation files, and it outputs a consistently structured database ready for use in machine learning projects.
Used by 2 other packages. Available on PyPI.
Use this if you are a machine learning engineer or researcher working with media data and need a reliable, standardized format to manage your datasets and ensure easy integration across different projects.
Not ideal if you are looking for a media player, an annotation tool, or a general-purpose database system for non-ML data.
Stars
12
Forks
1
Language
Python
License
—
Category
Last pushed
Mar 19, 2026
Commits (30d)
0
Dependencies
8
Reverse dependents
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/audeering/audformat"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
iver56/audiomentations
A Python library for audio data augmentation. Useful for making audio ML models work well in the...
Rikorose/DeepFilterNet
Noise supression using deep filtering
torchsynth/torchsynth
A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.
marl/openl3
OpenL3: Open-source deep audio and image embeddings
archinetai/audio-data-pytorch
A collection of useful audio datasets and transforms for PyTorch.