aarmea/readability-scrape
Retrieve simplified versions of webpages, powered by Mozilla's Readability.js
This tool helps anyone who needs to quickly get the main content from a webpage without distractions like ads, menus, or sidebars. You provide a web address, and it gives you a clean, simplified version of the article as plain text or HTML. Journalists, researchers, content curators, or anyone who frequently extracts information from web articles will find this useful.
No commits in the last 6 months. Available on npm.
Use this if you need to extract the core text or HTML content from many web articles, stripped of all non-essential elements.
Not ideal if you need to preserve the original webpage's full layout, styling, or interactive features.
Stars
15
Forks
—
Language
JavaScript
License
—
Category
Last pushed
Oct 14, 2018
Commits (30d)
0
Dependencies
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/aarmea/readability-scrape"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Altimis/Scweet
A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...
lexiforest/curl_cffi
Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...
plabayo/rama
modular service framework to move and transform network packets
scrapinghub/spidermon
Scrapy Extension for monitoring spiders execution.