philschmid/clipper.js
HTML to Markdown converter and crawler.
This tool helps you quickly save content from web pages or local HTML files by converting them into clean Markdown text. You input a web address or an HTML file, and it outputs the main article content as a Markdown file. It's designed for anyone who wants to archive web content or take notes without relying on browser extensions or cloud services.
614 stars. No commits in the last 6 months.
Use this if you need a straightforward way to extract and save web articles or local HTML content as Markdown for offline reading, note-taking, or building personal knowledge bases.
Not ideal if you need a visual clipping experience, advanced annotation features, or integration with specific note-taking apps that require a graphical user interface.
Stars
614
Forks
39
Language
TypeScript
License
Apache-2.0
Category
Last pushed
Jan 09, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/philschmid/clipper.js"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
any4ai/AnyCrawl
AnyCrawl π: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts...
kreuzberg-dev/html-to-markdown
High performance and CommonMark compliant HTML to Markdown converter. Maintained by the...
ScrapeGraphAI/Scrapegraph-ai
Python scraper based on AI
adbar/trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping,...
paulpierre/markdown-crawler
A multithreaded πΈοΈ web crawler that recursively crawls a website and creates a π½ markdown file...