cylondata/cylon
Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.
Cylon helps data and AI/ML engineers quickly process large, structured datasets. It takes your existing data (like tables or spreadsheets) and applies common operations such as combining, filtering, or sorting, outputting the transformed data. This is ideal for those who work with vast amounts of information and need to perform data transformations efficiently.
302 stars.
Use this if you need to perform fast, scalable data processing on large, structured datasets across multiple machines.
Not ideal if you are working with small datasets that can be processed quickly on a single machine or if you do not have distributed computing resources.
Stars
302
Forks
46
Language
Jupyter Notebook
License
Apache-2.0
Category
Last pushed
Mar 12, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/cylondata/cylon"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
deepspeedai/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference...
helmholtz-analytics/heat
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
horovod/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
bsc-wdc/dislib
The Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.