MaxHalford/tuna
:fish: A streaming ETL for fish
This project helps process very large datasets by calculating running statistics on data that won't fit into your computer's memory. You input raw data, often from CSV files, and it outputs aggregated results like averages, sums, or variances for different categories. This is ideal for data analysts, business intelligence professionals, or researchers who need to analyze massive streaming or batch datasets efficiently.
No commits in the last 6 months.
Use this if you need to calculate aggregate statistics like means, sums, or variances on datasets that are too large to load entirely into memory.
Not ideal if your datasets are small enough to process with standard in-memory tools or if you need complex, multi-stage data transformations beyond aggregation.
Stars
13
Forks
1
Language
Go
License
MIT
Category
Last pushed
Jan 24, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/data-engineering/MaxHalford/tuna"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
rocketlaunchr/dataframe-go
DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
crackcell/hpipe
Workflow engine for various computing systems.
Xenios91/Byte-Chomp
A Golang tool for obtaining data on Golang binaries in csv format
yaricom/dfutils
The collection of utilities to work with QFrame implementation of the data frames in GO language...