JovianHQ/opendatasets
A Python library for downloading datasets from Kaggle, Google Drive, and other online sources.
This tool helps data analysts and scientists quickly get the data they need for their projects. You provide a link to a dataset on platforms like Kaggle or Google Drive, and it downloads the files directly to your computer. It's designed for anyone who regularly works with publicly available datasets for analysis or model training.
347 stars.
Use this if you need to easily download datasets from popular online sources like Kaggle or Google Drive for your data analysis or machine learning workflows.
Not ideal if you need to scrape data from websites or work with proprietary datasets that require complex authentication.
Stars
347
Forks
142
Language
Python
License
MIT
Category
Last pushed
Jan 10, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/JovianHQ/opendatasets"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
open-edge-platform/datumaro
Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage...
explosion/ml-datasets
🌊 Machine learning dataset loaders for testing and example scripts
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with...
tensorflow/datasets
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
mlcommons/croissant
Croissant is a high-level format for machine learning datasets that brings together four rich layers.