xorbitsai/xorbits
Scalable Python DS & ML, in an API compatible & lightning fast way.
When your data science and machine learning tasks involve large datasets that strain your computer's memory or take too long to process, Xorbits helps. It takes your existing data analysis code, often written in Python with libraries like pandas or NumPy, and allows it to run much faster and handle much larger amounts of data without significant code changes. This is ideal for data scientists, machine learning engineers, and analysts who need to scale their workflows from a single machine to a powerful cluster.
1,203 stars. Available on PyPI.
Use this if you are a data scientist or ML practitioner working with Python and need to speed up your data processing or analyze datasets that are too large for your current setup, without rewriting your entire codebase.
Not ideal if your datasets are small and your existing Python scripts already run efficiently on your local machine.
Stars
1,203
Forks
70
Language
Python
License
Apache-2.0
Category
Last pushed
Feb 14, 2026
Commits (30d)
0
Dependencies
15
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/xorbitsai/xorbits"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
deepspeedai/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference...
helmholtz-analytics/heat
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
horovod/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
bsc-wdc/dislib
The Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.