justmarkham/trump-lies

Tutorial: Web scraping in Python with Beautiful Soup

/ 100

Emerging

This project helps you take information published on a website, like a news article, and turn it into a structured dataset. It shows you how to programmatically extract specific pieces of text (like dates, quotes, and links) from a web page and save them into a file. Anyone who needs to collect and organize data from public websites for analysis or record-keeping would find this useful.

247 stars. No commits in the last 6 months.

Use this if you need to extract specific information from a static web page and store it in a clean, organized format.

Not ideal if you're dealing with very complex websites that require login, dynamic content loading, or have strong anti-scraping measures.

data-extraction information-gathering content-analysis research

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 25 / 25

How are scores calculated?

Stars

247

Forks

217

Language

Jupyter Notebook

License

—

Featured in

Giving AI Agents Eyes: Browser Automation in 2026

Higher-rated alternatives

scrapy/scrapy

Scrapy, a fast high-level web crawling & scraping framework for Python.

Altimis/Scweet

A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...

lexiforest/curl_cffi

Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...

plabayo/rama

modular service framework to move and transform network packets

scrapinghub/spidermon

Scrapy Extension for monitoring spiders execution.

Explore Perception Tools

All categories Trending Perception directory Insights