cylondata/cylon

Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.

56
/ 100
Established

Cylon helps data and AI/ML engineers quickly process large, structured datasets. It takes your existing data (like tables or spreadsheets) and applies common operations such as combining, filtering, or sorting, outputting the transformed data. This is ideal for those who work with vast amounts of information and need to perform data transformations efficiently.

302 stars.

Use this if you need to perform fast, scalable data processing on large, structured datasets across multiple machines.

Not ideal if you are working with small datasets that can be processed quickly on a single machine or if you do not have distributed computing resources.

data-engineering machine-learning-engineering big-data-processing distributed-computing data-transformation
No Package No Dependents
Maintenance 10 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 20 / 25

How are scores calculated?

Stars

302

Forks

46

Language

Jupyter Notebook

License

Apache-2.0

Last pushed

Mar 12, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/cylondata/cylon"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.