CubicZebra/informatics

Framework of fast implementation data processing and operating pipelines

/ 100

Emerging

This tool helps scientists, engineers, and analysts efficiently manage and process their data. It takes raw or structured data, allows for cleaning, transformation, and advanced analysis, and outputs dynamically visualized results or prepared data for model training and deployment. It's designed for anyone who needs to build custom, modular data pipelines.

585 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to systematically collect, manipulate, store, retrieve, or classify data within scientific, engineering, or analytical projects.

Not ideal if you are looking for a simple, out-of-the-box solution without the need for custom data pipeline construction.

data-analysis scientific-research engineering-workflows data-processing model-deployment

Stale 6m

Maintenance 2 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 10 / 25

How are scores calculated?

Stars

585

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

treeverse/dvc

🦉 Data Versioning and ML Experiments

runpod/runpod-python

🐍 | Python library for RunPod API and serverless worker SDK.

microsoft/vscode-jupyter

VS Code Jupyter extension

4paradigm/OpenMLDB

OpenMLDB is an open-source machine learning database that provides a feature platform computing...

uber/petastorm

Petastorm library enables single machine or distributed training and evaluation of deep learning...

Explore ML Frameworks

All categories Trending ML Framework directory Insights