armand1m/papercut

Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.

38
/ 100
Emerging

This tool helps web developers gather specific data from websites, especially those with dynamic content or multiple pages. You provide a website's URL and define what pieces of information you want (like product names, prices, or addresses), and it returns that data in a structured format. It's designed for developers building applications that need to collect information programmatically from the web.

No commits in the last 6 months. Available on npm.

Use this if you are a web developer who needs to programmatically extract structured data from single pages, paginated lists, or even modern single-page applications.

Not ideal if you need a simple point-and-click tool for occasional data extraction without writing code.

web-scraping data-extraction web-crawling data-collection api-development
Stale 6m
Maintenance 0 / 25
Adoption 7 / 25
Maturity 25 / 25
Community 6 / 25

How are scores calculated?

Stars

39

Forks

2

Language

TypeScript

License

MIT

Category

scraper

Last pushed

Jan 08, 2023

Commits (30d)

0

Dependencies

5

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/perception/armand1m/papercut"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.