scrapinghub/web-poet

Web scraping Page Objects core library

66
/ 100
Established

This library helps web scraping developers organize their code for extracting specific data from websites. It takes raw HTML content from a web page and, through structured code, outputs the desired data points like product names, prices, or article text. It's designed for Python developers who build and maintain web scrapers.

105 stars. Used by 1 other package. Available on PyPI.

Use this if you are a web scraping developer looking to make your parsing logic more maintainable, reusable, and testable across different web pages.

Not ideal if you are looking for a complete web scraping framework or a tool that handles fetching web pages, as this focuses specifically on the data extraction part.

web-scraping data-extraction parsing-logic developer-tool code-maintainability
Maintenance 13 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 18 / 25

How are scores calculated?

Stars

105

Forks

18

Language

Python

License

BSD-3-Clause

Category

scraper

Last pushed

Apr 02, 2026

Commits (30d)

0

Dependencies

11

Reverse dependents

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/perception/scrapinghub/web-poet"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.