mehd-io/pypi-duck-flow

end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence

50
/ 100
Established

This project helps data professionals understand usage patterns for Python projects on PyPI. It processes raw PyPI download logs, cleans and transforms them into meaningful metrics, and then presents these insights in an interactive dashboard. Data engineers or analysts can use this to monitor their projects or analyze the Python package ecosystem.

234 stars.

Use this if you need to build an end-to-end pipeline to gather, process, and visualize data about PyPI package downloads, especially if you work with Python, SQL, and DuckDB.

Not ideal if you're looking for a simple, pre-built web service to query PyPI stats without setting up any data infrastructure.

data-engineering python-package-analytics data-pipelines data-visualization pypi-insights
No License No Package No Dependents
Maintenance 13 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 19 / 25

How are scores calculated?

Stars

234

Forks

36

Language

TypeScript

License

Last pushed

Mar 16, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/mehd-io/pypi-duck-flow"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.