masesk/process-google-dataset
Process Google Dataset is a tool to download and process images for neural networks from a Google Image Search using a Chrome extension and a simple Python code.
This tool helps machine learning engineers and researchers quickly gather image datasets for training computer vision models. You use a Chrome extension to browse Google Images, which outputs a JSON file of image links. A Python script then uses this file to download the images and organize them into labeled folders.
No commits in the last 6 months.
Use this if you need to quickly build a custom image dataset from Google Images for your machine learning projects, especially for prototyping or specific niche categories.
Not ideal if you need perfectly clean, pre-labeled, and high-resolution datasets for production-grade models, or if you prefer not to use a browser extension for data collection.
Stars
33
Forks
6
Language
JavaScript
License
GPL-3.0
Category
Last pushed
Sep 08, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/masesk/process-google-dataset"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M...
devrimcavusoglu/pybboxes
Light weight toolkit for bounding boxes providing conversion between bounding box types and...
PyRetri/PyRetri
Open source deep learning based unsupervised image retrieval toolbox built on PyTorch🔥
Particle1904/DatasetHelpers
Dataset Helper program to automatically select, re scale and tag Datasets (composed of image and...
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence