kreuzberg-dev/html-to-markdown

High performance and CommonMark compliant HTML to Markdown converter. Maintained by the Kreuzberg team. Kreuzberg is a fast, polyglot document intelligence engine with a Rust core. It extracts structured data from 56+ document formats using streaming parsers and built-in OCR.

64
/ 100
Established

This tool helps developers transform web page content or other HTML snippets into clean, readable Markdown format. You provide it with raw HTML, and it outputs well-structured Markdown, along with extracted metadata like titles, links, and tables. It's designed for developers building applications that process or display web content, ensuring consistent conversion across various programming languages.

565 stars. Actively maintained with 158 commits in the last 30 days.

Use this if you need to reliably convert HTML content into Markdown for storage, display, or further processing within a software application, especially across different programming languages.

Not ideal if you are an end-user needing a simple drag-and-drop tool for occasional personal HTML to Markdown conversion without programming.

web-content-processing data-transformation document-conversion content-management developer-tooling
No Package No Dependents
Maintenance 22 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 16 / 25

How are scores calculated?

Stars

565

Forks

50

Language

HTML

License

MIT

Last pushed

Mar 13, 2026

Commits (30d)

158

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/kreuzberg-dev/html-to-markdown"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.