datajoint/datajoint-python
Relational data pipelines for the science lab
DataJoint helps scientists build robust, reproducible workflows for their research experiments and analyses. You define each step of your scientific process, how data flows between them, and the computations involved. It takes your raw experimental data and processes it into structured, traceable results. This is ideal for scientists, lab managers, and research teams who need to manage complex, multi-step data processing.
191 stars. Available on PyPI.
Use this if you need to organize complex scientific data processing into repeatable steps with clear dependencies and guaranteed result traceability.
Not ideal if your data processing involves simple, one-off scripts without complex interdependencies or if you are not working with structured scientific data.
Stars
191
Forks
96
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 13, 2026
Commits (30d)
0
Dependencies
10
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/datajoint/datajoint-python"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.