Edwardvaneechoud/Flowfile
Flowfile is a visual ETL tool and Python library combining drag-and-drop workflows with Polars dataframes. Build data pipelines visually, define flows programmatically with a Polars-like API, and export to standalone Python code. Perfect for fast, intuitive data processing from development to production.
This tool helps data analysts, business intelligence specialists, and operations engineers visually build and manage data cleaning and transformation pipelines. You start by connecting to raw data sources, dragging and dropping nodes to define transformations, and end with clean, standardized datasets or ready-to-deploy Python code. It's designed for users who need to quickly process large volumes of data without writing extensive code.
226 stars.
Use this if you need to quickly prepare, clean, and integrate data from various sources for analysis or reporting, especially when dealing with large datasets or messy Excel files.
Not ideal if you primarily work with small, static datasets that require only basic spreadsheet manipulations, or if your data pipelines already reside in a fully code-based environment.
Stars
226
Forks
17
Language
Python
License
MIT
Category
Last pushed
Mar 19, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/Edwardvaneechoud/Flowfile"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PrefectHQ/prefect
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
growthbook/growthbook
Open Source Feature Flags, Experimentation, and Product Analytics
koopjs/koop
Transform, query, and download geospatial data on the web.
pathwaycom/pathway
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
dagster-io/dagster
An orchestration platform for the development, production, and observation of data assets.