Ml Experiment Tracking Data Engineering Tools

There are 11 ml experiment tracking tools tracked. 3 score above 50 (established tier). The highest-rated is mage-ai/mage-ai at 68/100 with 8,672 stars. 1 of the top 10 are actively maintained.

Get all 11 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=data-engineering&subcategory=ml-experiment-tracking&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 mage-ai/mage-ai

🧙 Build, run, and manage data pipelines for integrating and transforming data.

68
Established
2 vaexio/vaex

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML,...

65
Established
3 alibaba/feathub

FeatHub - A stream-batch unified feature store for real-time machine learning

56
Established
4 mindsdb/dbt-mindsdb

dbt adapter for connecting to MindsDB

40
Emerging
5 kevin-hanselman/dud

A lightweight CLI tool for versioning data alongside source code and...

37
Emerging
6 Bread-Technologies/Bread-Dataset-Viewer

VS Code extension to easily view and handle large datasets. Look at...

36
Emerging
7 bytehub-ai/bytehub

ByteHub: making feature stores simple

32
Emerging
8 Paulescu/bytewax-hopsworks-example

Compute and store real-time features for crypto trading using Bytwax (stream...

31
Emerging
9 runprism/prism

Prism is the easiest way to develop, orchestrate, and execute data pipelines...

29
Experimental
10 guilycst/lazy-dvc

A serverless-style LFS alternative that uses GitHub Org membership as...

22
Experimental
11 felixmccuaig/flowbase

A declarative ML platform for tabular data that eliminates infrastructure...

21
Experimental

Comparisons in this category