hannah2gah/web-scraping-tokopedia
Repository for Tokopedia web scraping with Selenium and Beautiful Soup.
This project helps e-commerce analysts or market researchers gather product and seller information from Tokopedia. By providing a Tokopedia URL, it collects data like product names, prices, ratings, seller details, and sales estimates, outputting it into a CSV file. It's designed for anyone needing to quickly extract structured data for competitive analysis or market trends.
No commits in the last 6 months.
Use this if you need to systematically collect product, seller, and sales data from specific pages on Tokopedia for market research or competitive analysis.
Not ideal if you need real-time data or require scraping from e-commerce platforms other than Tokopedia.
Stars
13
Forks
2
Language
Python
License
—
Category
Last pushed
May 24, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/hannah2gah/web-scraping-tokopedia"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
seleniumbase/SeleniumBase
APIs for browser automation, testing, and bypassing bot-detection.
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers....
intoli/user-agents
A JavaScript library for generating random user agents with data that's updated daily.
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In...
Kaliiiiiiiiii-Vinyzu/patchright
Undetected version of the Playwright testing and automation library.