asampat3090/open-datasets
Running list of Open Datasets
This is a curated and organized list of publicly available datasets to help you find the data you need for your research or analysis. It provides direct links to various datasets, categorized by topic, indicating whether they are single or collections, and if they are free, paid, or require credentials. Scientists, researchers, data analysts, or students looking for specific data for projects in fields like biology, agriculture, or climate studies would find this useful.
No commits in the last 6 months.
Use this if you are looking for diverse, categorized public datasets to kickstart a research project, perform data analysis, or train models.
Not ideal if you need a specialized dataset not covered by common categories, or if you require an integrated API for direct data access rather than a list of links.
Stars
24
Forks
7
Language
—
License
—
Category
Last pushed
May 09, 2017
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/asampat3090/open-datasets"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
open-edge-platform/datumaro
Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage...
explosion/ml-datasets
🌊 Machine learning dataset loaders for testing and example scripts
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with...
tensorflow/datasets
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
mlcommons/croissant
Croissant is a high-level format for machine learning datasets that brings together four rich layers.