explosion/ml-datasets

🌊 Machine learning dataset loaders for testing and example scripts

74
/ 100
Verified

This tool helps developers quickly access standard machine learning datasets to build and test their natural language processing (NLP) or image recognition models. It provides readily available text data for tasks like sentiment analysis, question answering, or image data for recognition, giving developers the necessary input to train and evaluate their algorithms.

47 stars and 10,308 monthly downloads. Used by 1 other package. Available on PyPI.

Use this if you are a developer building or testing machine learning models and need convenient access to well-known, pre-structured datasets for tasks like text classification or image recognition.

Not ideal if you are a non-developer seeking an out-of-the-box solution to analyze your own specific data or a tool for general data management.

natural-language-processing image-recognition text-classification sentiment-analysis machine-learning-development
Maintenance 13 / 25
Adoption 18 / 25
Maturity 25 / 25
Community 18 / 25

How are scores calculated?

Stars

47

Forks

16

Language

Python

License

MIT

Last pushed

Mar 26, 2026

Monthly downloads

10,308

Commits (30d)

0

Dependencies

5

Reverse dependents

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/explosion/ml-datasets"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.