KryeKuzhinieri/Scrape-GettyImages-Using-Selenium
Using python and selenium download images from gettyimages.com
This project helps researchers and data scientists quickly gather image datasets for machine learning tasks. You provide a Getty Images search URL and the desired number of pages, and it downloads lower-resolution preview images to your specified directory. It's designed for anyone who needs to build a custom image dataset for classification or other AI training purposes without requiring high-resolution assets or a Getty Images account.
No commits in the last 6 months.
Use this if you need a collection of images from Getty Images for tasks like training an image classification model and don't require full-resolution photos.
Not ideal if you need high-resolution images, have a commercial use case requiring proper licensing, or are looking to download images from a different stock photo website.
Stars
17
Forks
5
Language
Python
License
MIT
Category
Last pushed
Feb 15, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/KryeKuzhinieri/Scrape-GettyImages-Using-Selenium"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
seleniumbase/SeleniumBase
APIs for browser automation, testing, and bypassing bot-detection.
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers....
intoli/user-agents
A JavaScript library for generating random user agents with data that's updated daily.
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In...
Kaliiiiiiiiii-Vinyzu/patchright
Undetected version of the Playwright testing and automation library.