Statistics for Data Science ML Frameworks

Educational resources, textbooks, and comprehensive courses on probability, statistics, and statistical methods specifically for data science applications. Includes lecture notes, tutorials, and problem sets. Does NOT include general machine learning algorithms, deep learning frameworks, or discipline-specific statistics (e.g., biostatistics, econometrics).

There are 28 statistics for data science frameworks tracked. 4 score above 50 (established tier). The highest-rated is D2RS-2026spring/data-driven-reproducible-study at 52/100 with 18 stars. 1 of the top 10 are actively maintained.

Get all 28 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=statistics-for-data-science&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 D2RS-2026spring/data-driven-reproducible-study

《数据驱动的可重复性研究》课程讲义

52
Established
2 wangyingsm/Python-Data-Science-Handbook

A Chinese translation of Jake Vanderplas' "Python Data Science Handbook"....

51
Established
3 unpingco/Python-for-Probability-Statistics-and-Machine-Learning

Jupyter Notebooks for Springer book "Python for Probability, Statistics, and...

51
Established
4 cfgranda/ps4ds

Probability and Statistics for Data Science: A self-contained introduction...

50
Established
5 APMonitor/pds

Machine Learning for Engineers in Python

49
Emerging
6 aeturrell/python4DS

Python for Data Science. This repository hosts the code behind the online...

49
Emerging
7 matteocourthoud/Machine-Learning-for-Economic-Analysis

Material for the exercise sessions of master course Machine Learning for...

46
Emerging
8 verri/dsp-book

Data Science Project: An Inductive Learning Approach

43
Emerging
9 muandet-lab/ipml-course

A course on imprecise probabilistic machine learning

42
Emerging
10 rickiepark/python4daml

<코딩 뇌를 깨우는 파이썬>(한빛미디어, 2023)의 코드 저장소

39
Emerging
11 alioh/ds-100-ar

Arabic Translation of Data 100 Textbook at UC Berkeley http://www.textbook.ds100.org/

39
Emerging
12 UWNETLAB/dcss_supplementary

Supplementary materials for McLevey 2021 Doing Computational Social Science...

39
Emerging
13 tomasonjo/graphs-network-science

Accompanying repository for my book about Graph Data Science

37
Emerging
14 apachecn/ds100-textbook-zh

:book: [译] UCB DS100 数据科学的原理与技巧

36
Emerging
15 harrywang/misy331

Course Website for MISY331 Machine Learning for Business

35
Emerging
16 Chandrakant817/Statistics-for-Data-Science

Statistics for Data Science and Machine Learning Handwritten Notes

34
Emerging
17 jdestefani/StatisticalFoundationsML_INFOF422

Repository for the Statistical Foundation of Machine Learning class (INFO-F-422).

33
Emerging
18 AIML-research/ML4DS-Lecture

Machine Learning for Data Science lecture at Freie University Berlin during WiSe21/22

22
Experimental
19 luqigroup/cap-4611

Course website for CAP 4611

17
Experimental
20 DiogoRibeiro7/academic-presentations

Professional-grade presentations on advanced statistics, MCMC methods, and...

15
Experimental
21 rugvedmhatre/machine-learning-summer

Course Website for ML Summer Course

15
Experimental
22 arushig02/Statistics-ML

Statistics for Machine Learning — Week 1

14
Experimental
23 Vinod123456183/DSMP-1.0

Ml

14
Experimental
24 Gaurav-Van/Scripted_Insights-Handwritten_DS_ML_Notes

Dive into the world of Data Science and Machine Learning with meticulously...

13
Experimental
25 djunicode/shalizi-stats

Reading Group for Cosma Shalizi's Textbook on Advanced Data Analysis

12
Experimental
26 JosephMehdiyev/Statistics-and-Probability-with-Code-Applications

A open-source book written by Joseph Mehdiyev for educational and...

12
Experimental
27 ml-repo/ml-repo.github.io

Statistical Learning

11
Experimental
28 bartczernicki/UncoveringSportsInsights

Repository for "Uncovering Sports Insights with Machine Intelligence" live...

11
Experimental