audeering/audformat

Format to store media files and annotations

51
/ 100
Established

This tool provides a standardized way to store collections of media files like audio, video, or text alongside their detailed annotations. It helps machine learning engineers or researchers organize their datasets, making it straightforward to swap or combine different datasets for model training and evaluation. You input raw media and annotation files, and it outputs a consistently structured database ready for use in machine learning projects.

Used by 2 other packages. Available on PyPI.

Use this if you are a machine learning engineer or researcher working with media data and need a reliable, standardized format to manage your datasets and ensure easy integration across different projects.

Not ideal if you are looking for a media player, an annotation tool, or a general-purpose database system for non-ML data.

machine-learning-datasets audio-analysis video-analysis multimodal-data research-data-management
Maintenance 13 / 25
Adoption 7 / 25
Maturity 25 / 25
Community 6 / 25

How are scores calculated?

Stars

12

Forks

1

Language

Python

License

Last pushed

Mar 19, 2026

Commits (30d)

0

Dependencies

8

Reverse dependents

2

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/audeering/audformat"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.