firmai/pandapy
PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)
PandaPy helps finance professionals and data analysts process tabular data faster for tasks like calculating returns or portfolio values. It takes structured datasets, similar to a spreadsheet or database table, and rapidly performs calculations and transformations, outputting the results as a new structured dataset. Anyone working with smaller datasets (up to around 50,000 records) who needs high performance from their Python tools will find this useful.
549 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to rapidly process small to medium-sized datasets with mixed data types on a single machine, especially for financial calculations, and want the usability of Pandas with the speed of NumPy.
Not ideal if your datasets consistently exceed 500,000 records or you require distributed computing across multiple machines, as other tools are optimized for very large-scale data processing.
Stars
549
Forks
66
Language
Python
License
—
Category
Last pushed
Oct 20, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/firmai/pandapy"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
deepspeedai/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference...
helmholtz-analytics/heat
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
horovod/horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
bsc-wdc/dislib
The Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.