omar-elmaria/python_scrapy_airflow_pipeline

This repo contains a full-fledged Python-based script that scrapes a JavaScript-rendered website, cleans the data, and pushes the results to a cloud-based database. The workflow is orchestrated on Airflow to run automatically

/ 100

Experimental

This project helps e-commerce analysts or competitive intelligence specialists automatically gather detailed product and pricing data from competitor websites, even those with anti-bot measures. It takes a website URL as input and outputs a structured table containing product names, categories, prices, discounts, delivery times, and other key details directly into a cloud database. This enables users to track competitor strategies and market trends without manual effort.

No commits in the last 6 months.

Use this if you need to regularly collect comprehensive product and pricing information from JavaScript-rendered e-commerce websites for competitive analysis or market research.

Not ideal if you only need to scrape a website once manually, or if you require a simple, code-free scraping solution for basic data extraction.

e-commerce-analytics competitor-intelligence market-research pricing-strategy product-assortment

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

—

Featured in

Giving AI Agents Eyes: Browser Automation in 2026

Higher-rated alternatives

scrapy/scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

Altimis/Scweet

A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...

lexiforest/curl_cffi

Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...

plabayo/rama

modular service framework to move and transform network packets

scrapinghub/spidermon

Scrapy Extension for monitoring spiders execution.

Explore Perception Tools

All categories Trending Perception directory Insights