aarmea/readability-scrape

Retrieve simplified versions of webpages, powered by Mozilla's Readability.js

31
/ 100
Emerging

This tool helps anyone who needs to quickly get the main content from a webpage without distractions like ads, menus, or sidebars. You provide a web address, and it gives you a clean, simplified version of the article as plain text or HTML. Journalists, researchers, content curators, or anyone who frequently extracts information from web articles will find this useful.

No commits in the last 6 months. Available on npm.

Use this if you need to extract the core text or HTML content from many web articles, stripped of all non-essential elements.

Not ideal if you need to preserve the original webpage's full layout, styling, or interactive features.

content-extraction research-assist web-scraping information-retrieval text-analysis-prep
Stale 6m
Maintenance 0 / 25
Adoption 6 / 25
Maturity 25 / 25
Community 0 / 25

How are scores calculated?

Stars

15

Forks

Language

JavaScript

License

Category

scraper

Last pushed

Oct 14, 2018

Commits (30d)

0

Dependencies

3

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/perception/aarmea/readability-scrape"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.