noahgift/web_scraping_python

Techniques for Scraping the Web in Python

34
/ 100
Emerging

This project helps data professionals gather information from websites automatically and at scale. It takes website URLs or specific web pages as input and extracts structured data, which can then be used for analysis or fed into AI/ML models. This is ideal for data scientists, analysts, or AI engineers who need to collect large datasets from the web.

No commits in the last 6 months.

Use this if you need to programmatically collect data from multiple websites to build datasets for machine learning, market research, or content aggregation.

Not ideal if you need a simple, one-off data extraction from a single page or prefer a no-code solution.

data-collection market-research competitive-intelligence content-aggregation dataset-building
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 8 / 25
Community 19 / 25

How are scores calculated?

Stars

27

Forks

22

Language

Jupyter Notebook

License

Category

scraper

Last pushed

May 31, 2018

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/perception/noahgift/web_scraping_python"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.