jurismarches/chopper

Chopper is a tool to extract elements from HTML by preserving ancestors and CSS rules

51
/ 100
Established

This tool helps web content creators or developers meticulously extract specific sections from HTML documents while preserving the essential styling and structural context. You provide an HTML page and its associated CSS, specify which parts you want to keep and discard, and it outputs a clean, smaller HTML snippet with only the chosen content and the relevant CSS rules. This is ideal for anyone needing to repurpose or display a fragment of a webpage without losing its intended appearance.

Available on PyPI.

Use this if you need to isolate a specific part of an HTML document, like a product description or an article paragraph, ensuring its visual style and parent structure remain intact without including unnecessary surrounding elements or CSS.

Not ideal if you're looking for a simple text extractor or a tool that completely restyles content, as its primary function is to preserve original styling and HTML structure for selected elements.

web-content-extraction html-parsing css-preservation web-scraping content-curation
Maintenance 13 / 25
Adoption 6 / 25
Maturity 25 / 25
Community 7 / 25

How are scores calculated?

Stars

23

Forks

2

Language

Python

License

MIT

Category

scraper

Last pushed

Mar 30, 2026

Commits (30d)

0

Dependencies

3

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/perception/jurismarches/chopper"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.