unsplash/datasets
🎁 6,500,000+ Unsplash images made available for research and machine learning
This dataset provides a vast collection of high-quality Unsplash photos, along with associated keywords and search data. It's designed for researchers and machine learning practitioners who need a rich source of image data for training models or conducting studies. You get millions of images, keywords, and search queries, which can be used to understand visual content and user intent.
2,680 stars. No commits in the last 6 months.
Use this if you need a large, diverse dataset of images and associated text for academic research or developing machine learning models related to image recognition, content understanding, or search relevance.
Not ideal if you need images for commercial products or applications, as the full dataset is strictly for non-commercial research, and redistribution of images is prohibited.
Stars
2,680
Forks
135
Language
Jupyter Notebook
License
—
Category
Last pushed
Apr 17, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/unsplash/datasets"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
open-edge-platform/datumaro
Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage...
explosion/ml-datasets
🌊 Machine learning dataset loaders for testing and example scripts
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with...
tensorflow/datasets
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
mlcommons/croissant
Croissant is a high-level format for machine learning datasets that brings together four rich layers.