GoncaloMark/CobWeb-lnx

CobWeb is a Python library for web scraping. The library consists of two classes: Spider and Scraper.

29
/ 100
Experimental

CobWeb helps developers extract specific information from websites by first exploring a site's links and then pulling out content like text, images, or specific data points. You input a starting website URL and define what you're looking for (e.g., all headings, specific paragraphs, or links), and it outputs the collected data in a structured format. This tool is for Python developers who need to programmatically gather data from many web pages for analysis or other applications.

No commits in the last 6 months.

Use this if you are a Python developer and need to automate the collection of specific data from multiple web pages or an entire website.

Not ideal if you are not a programmer or need a visual, point-and-click tool for web scraping.

data-acquisition web-data-extraction developer-tooling content-gathering
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 6 / 25

How are scores calculated?

Stars

39

Forks

2

Language

Python

License

MIT

Category

scraper

Last pushed

Nov 28, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/perception/GoncaloMark/CobWeb-lnx"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.