gustavovalverde/h2m-parser
Fast HTML to Markdown converter with Mozilla Readability extraction, streaming renderer, and LLM-ready output. 4x times faster than famous alternatives
This tool helps content managers, researchers, and data analysts quickly extract the main article content from any webpage and convert it into clean, structured Markdown. It takes raw HTML from a web page and outputs a refined Markdown document, optionally including important metadata and content suitable for further analysis or integration with AI tools. The typical user needs to process web articles for various applications, like building knowledge bases or training language models.
No commits in the last 6 months. Available on npm.
Use this if you need a fast and reliable way to convert website articles into clean Markdown, especially for large volumes or for use with AI systems.
Not ideal if you only need to convert simple HTML snippets without the need for article extraction or advanced post-processing.
Stars
9
Forks
1
Language
TypeScript
License
MIT
Category
Last pushed
Oct 06, 2025
Commits (30d)
0
Dependencies
5
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/gustavovalverde/h2m-parser"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
microsoft/markitdown
Python tool for converting files and office documents to Markdown.
doocs/md
✍ WeChat Markdown Editor | 一款高度简洁的微信 Markdown 编辑器:支持 Markdown 语法、自定义主题样式、内容管理、多图床、AI 助手等特性
AIDotNet/OpenDeepWiki
OpenDeepWiki is the open-source version of the DeepWiki project, aiming to provide a powerful...
hyperfield/ai-file-sorter
Cross-platform desktop application for content-aware file organization and renaming. Supports...
drl990114/MarkFlowy
The AI Markdown Editor