hegongshan/Storage-for-AI-Paper

Accelerating AI Training and Inference from Storage Perspective (Must-read Papers on Storage for AI)

/ 100

Emerging

This project offers a curated collection of research papers focused on optimizing storage systems for AI and deep learning workloads. It helps AI/ML engineers and researchers understand how different storage formats, systems, caching, and data preprocessing techniques impact the speed and efficiency of training and inference. You'll find papers addressing common bottlenecks and solutions in handling large datasets for AI.

Use this if you are an AI/ML engineer or researcher looking for academic literature and practical solutions to improve the performance of your AI models by addressing data storage and loading inefficiencies.

Not ideal if you are looking for an out-of-the-box software tool or library to directly implement, as this is a collection of research papers rather than a deployable solution.

AI-infrastructure deep-learning-optimization data-pipeline-performance ML-engineering storage-architecture

No License No Package No Dependents

Maintenance 10 / 25

Adoption 8 / 25

Maturity 8 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

—

License

—

Higher-rated alternatives

deepspeedai/DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference...

helmholtz-analytics/heat

Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python

hpcaitech/ColossalAI

Making large AI models cheaper, faster and more accessible

horovod/horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

bsc-wdc/dislib

The Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.

Explore ML Frameworks

All categories Trending ML Framework directory Insights