deepspeedai/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
DeepSpeed is a deep learning optimization library that helps machine learning engineers and researchers train very large AI models, such as large language models, more efficiently. It takes your existing deep learning model and training script and outputs a faster, more scalable training process, often enabling models that would otherwise be too big to train. It's designed for practitioners who need to push the boundaries of model size and performance.
41,801 stars. Used by 25 other packages. Actively maintained with 30 commits in the last 30 days. Available on PyPI.
Use this if you are training large-scale deep learning models and facing challenges with memory limits, slow training times, or difficulty scaling across multiple GPUs or machines.
Not ideal if you are working with small to medium-sized models or are not comfortable with distributed training concepts and configurations.
Stars
41,801
Forks
4,751
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 13, 2026
Commits (30d)
30
Dependencies
11
Reverse dependents
25
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/deepspeedai/DeepSpeed"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Recent Releases
Related frameworks
helmholtz-analytics/heat
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
horovod/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
bsc-wdc/dislib
The Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.
learning-at-home/hivemind
Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across...