rkhan055/SHADE

SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training

/ 100

Emerging

This system helps machine learning engineers or researchers accelerate the training of deep learning models on large datasets across multiple machines. It intelligently identifies and caches the most important data samples during distributed training, reducing the need to repeatedly fetch data from storage. The input is your existing deep learning model and dataset, and the output is faster training times for your models.

No commits in the last 6 months.

Use this if you are a machine learning engineer or researcher experiencing slow deep learning model training due to data fetching bottlenecks in a distributed computing environment.

Not ideal if you are working with small datasets or training models on a single machine where data caching is less critical for performance.

deep-learning-training distributed-computing machine-learning-operations model-training-acceleration

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

deepspeedai/DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference...

helmholtz-analytics/heat

Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python

hpcaitech/ColossalAI

Making large AI models cheaper, faster and more accessible

horovod/horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

bsc-wdc/dislib

The Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.

Explore ML Frameworks

All categories Trending ML Framework directory Insights