gagolews/teaching-data
Prof. Marek's Data for Teaching/Training
This project provides a collection of diverse datasets designed for teaching and training in statistics and machine learning. Unlike overly simplistic examples, these datasets reflect real-world complexities, helping students and practitioners develop robust analytical skills. You get raw data files, and the output is a more realistic understanding of data analysis challenges. Anyone learning or teaching data science, statistics, or machine learning would find these useful.
No commits in the last 6 months.
Use this if you are a student, educator, or practitioner looking for challenging, realistic datasets to practice data analysis, statistical modeling, or machine learning, without cherry-picking 'easy' examples.
Not ideal if you are looking for simple, clean datasets that will always yield clear, 'interesting' stories or easy-to-model patterns.
Stars
29
Forks
52
Language
—
License
—
Category
Last pushed
Mar 14, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/gagolews/teaching-data"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
laresbernardo/lares
Analytics & Machine Learning R Sidekick
lucasmaystre/choix
Inference algorithms for models based on Luce's choice axiom
TheAlgorithms/R
Collection of various algorithms implemented in R.
easystats/performance
:muscle: Models' quality and performance metrics (R2, ICC, LOO, AIC, BF, ...)
mlr-org/mlr
Machine Learning in R