Oxen-AI/Oxen
Lightning fast data version control system for structured and unstructured machine learning datasets. We aim to make versioning datasets as easy as versioning code.
Oxen helps machine learning practitioners manage and version large datasets, just like they version code. It takes in various data types like images, video, audio, or tabular data and outputs a versioned, traceable dataset, allowing teams to collaborate on models with confidence in their data's history. This tool is for ML engineers, data scientists, and researchers.
1,117 stars. Actively maintained with 95 commits in the last 30 days.
Use this if you need to track changes, collaborate on, and efficiently manage massive, diverse datasets for your machine learning projects.
Not ideal if your datasets are small, static, or you only work with code versioning tools like Git.
Stars
1,117
Forks
23
Language
Rust
License
Apache-2.0
Category
Last pushed
Mar 13, 2026
Commits (30d)
95
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Oxen-AI/Oxen"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
EnzymeAD/Enzyme
High-performance automatic differentiation of LLVM and MLIR.
LaurentMazare/tch-rs
Rust bindings for the C++ api of PyTorch.
SunDoge/dlpark
A Rust Library for High-Performance Tensor Exchange with Python
TheMesocarp/koho
Full spectrum sheaf neural network over arbitrary CW complexes.
Photoroom/datago
A natively parallel dataloader for Python, written in Rust. Serving data at GB/s speeds, while...