jbrownlee/Datasets

Machine learning datasets used in tutorials on MachineLearningMastery.com

43
/ 100
Emerging

This collection provides a stable source for various datasets commonly used in machine learning exercises. It offers clean, pre-formatted CSV files for tasks like predicting outcomes from medical records or financial data, classifying images, forecasting sales, or analyzing text. Data scientists, students, and practitioners learning or experimenting with machine learning models will find these useful for practice and validating algorithms.

1,224 stars. No commits in the last 6 months.

Use this if you need reliable, pre-processed datasets for training and testing machine learning models in classification, regression, or time series analysis.

Not ideal if you require real-time data feeds, highly specialized domain-specific datasets not listed, or if your primary need is for raw, unprocessed data for feature engineering practice.

predictive-modeling data-science-education statistical-analysis time-series-forecasting natural-language-processing
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 25 / 25

How are scores calculated?

Stars

1,224

Forks

1,497

Language

License

Last pushed

Aug 15, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/jbrownlee/Datasets"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.