paypal/gators
Gators is a package to handle model building with big data and fast real-time pre-processing, even for a large number of QPS, using only Python.
This tool helps data scientists and machine learning engineers rapidly prepare large datasets for model building. It takes raw, messy data with various types (numeric, categorical, text, dates) and transforms it into clean, structured features suitable for training predictive models. The output is a refined dataset, ready for machine learning, even for applications requiring fast, real-time processing.
No commits in the last 6 months.
Use this if you need to quickly and reliably clean, transform, and generate features from large datasets for machine learning models, especially for high-throughput systems like fraud detection or risk modeling.
Not ideal if your data preparation needs are minimal, your datasets are small, or you primarily work with simple, predefined transformations that don't require extensive feature engineering.
Stars
9
Forks
7
Language
HTML
License
Apache-2.0
Category
Last pushed
Feb 13, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/paypal/gators"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
deepspeedai/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference...
helmholtz-analytics/heat
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
horovod/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
bsc-wdc/dislib
The Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.