audeering/audformat

Format to store media files and annotations

/ 100

Established

This tool provides a standardized way to store collections of media files like audio, video, or text alongside their detailed annotations. It helps machine learning engineers or researchers organize their datasets, making it straightforward to swap or combine different datasets for model training and evaluation. You input raw media and annotation files, and it outputs a consistently structured database ready for use in machine learning projects.

Used by 2 other packages. Available on PyPI.

Use this if you are a machine learning engineer or researcher working with media data and need a reliable, standardized format to manage your datasets and ensure easy integration across different projects.

Not ideal if you are looking for a media player, an annotation tool, or a general-purpose database system for non-ML data.

machine-learning-datasets audio-analysis video-analysis multimodal-data research-data-management

Maintenance 13 / 25

Adoption 7 / 25

Maturity 25 / 25

Community 6 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Related frameworks

iver56/audiomentations

A Python library for audio data augmentation. Useful for making audio ML models work well in the...

Rikorose/DeepFilterNet

Noise supression using deep filtering

torchsynth/torchsynth

A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.

marl/openl3

OpenL3: Open-source deep audio and image embeddings

archinetai/audio-data-pytorch

A collection of useful audio datasets and transforms for PyTorch.

Explore ML Frameworks

All categories Trending ML Framework directory Insights