CubicZebra/informatics
Framework of fast implementation data processing and operating pipelines
This tool helps scientists, engineers, and analysts efficiently manage and process their data. It takes raw or structured data, allows for cleaning, transformation, and advanced analysis, and outputs dynamically visualized results or prepared data for model training and deployment. It's designed for anyone who needs to build custom, modular data pipelines.
585 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to systematically collect, manipulate, store, retrieve, or classify data within scientific, engineering, or analytical projects.
Not ideal if you are looking for a simple, out-of-the-box solution without the need for custom data pipeline construction.
Stars
585
Forks
17
Language
Python
License
Apache-2.0
Category
Last pushed
Sep 06, 2025
Commits (30d)
0
Dependencies
5
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/CubicZebra/informatics"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
treeverse/dvc
🦉 Data Versioning and ML Experiments
runpod/runpod-python
🐍 | Python library for RunPod API and serverless worker SDK.
microsoft/vscode-jupyter
VS Code Jupyter extension
4paradigm/OpenMLDB
OpenMLDB is an open-source machine learning database that provides a feature platform computing...
uber/petastorm
Petastorm library enables single machine or distributed training and evaluation of deep learning...