Distributed Training Frameworks Data Engineering Tools

There are 5 distributed training frameworks tools tracked. 2 score above 50 (established tier). The highest-rated is fugue-project/fugue at 64/100 with 2,142 stars.

Get all 5 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=data-engineering&subcategory=distributed-training-frameworks&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 fugue-project/fugue

A unified interface for distributed computing. Fugue executes SQL, Python,...

64
Established
2 heavyai/heavydb

HeavyDB (formerly MapD/OmniSciDB)

54
Established
3 BlazingDB/blazingsql

BlazingSQL is a lightweight, GPU accelerated, SQL engine for Python. Built...

46
Emerging
4 intel/hdk

A low-level execution library for analytic data processing.

41
Emerging
5 PatrickPontes44/tiny-panda

tiny-panda is a lightweight JavaScript library inspired by Python’s pandas....

21
Experimental