Oxen-AI/Oxen

Lightning fast data version control system for structured and unstructured machine learning datasets. We aim to make versioning datasets as easy as versioning code.

59
/ 100
Established

Oxen helps machine learning practitioners manage and version large datasets, just like they version code. It takes in various data types like images, video, audio, or tabular data and outputs a versioned, traceable dataset, allowing teams to collaborate on models with confidence in their data's history. This tool is for ML engineers, data scientists, and researchers.

1,117 stars. Actively maintained with 95 commits in the last 30 days.

Use this if you need to track changes, collaborate on, and efficiently manage massive, diverse datasets for your machine learning projects.

Not ideal if your datasets are small, static, or you only work with code versioning tools like Git.

machine-learning-engineering data-versioning dataset-management ml-operations data-science
No Package No Dependents
Maintenance 22 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 11 / 25

How are scores calculated?

Stars

1,117

Forks

23

Language

Rust

License

Apache-2.0

Last pushed

Mar 13, 2026

Commits (30d)

95

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Oxen-AI/Oxen"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.