cylondata/cylon

Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.

/ 100

Established

Cylon helps data and AI/ML engineers quickly process large, structured datasets. It takes your existing data (like tables or spreadsheets) and applies common operations such as combining, filtering, or sorting, outputting the transformed data. This is ideal for those who work with vast amounts of information and need to perform data transformations efficiently.

302 stars.

Use this if you need to perform fast, scalable data processing on large, structured datasets across multiple machines.

Not ideal if you are working with small datasets that can be processed quickly on a single machine or if you do not have distributed computing resources.

data-engineering machine-learning-engineering big-data-processing distributed-computing data-transformation

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

302

Forks

Language

Jupyter Notebook

License

Apache-2.0

Related frameworks

deepspeedai/DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference...

helmholtz-analytics/heat

Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python

hpcaitech/ColossalAI

Making large AI models cheaper, faster and more accessible

horovod/horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

bsc-wdc/dislib

The Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.

Explore ML Frameworks

All categories Trending ML Framework directory Insights