Nano-Collective/get-md

A fast, lightweight HTML to Markdown converter optimized for LLM consumption. Uses proven parsing libraries to deliver clean, well-structured markdown with intelligent content extraction and noise filtering.

34
/ 100
Emerging

This tool helps developers and AI engineers quickly transform messy HTML content, like web pages or article bodies, into clean, structured Markdown. It takes either raw HTML text or a URL and outputs well-formatted Markdown that is optimized for consumption by Large Language Models (LLMs). Anyone building or working with LLM applications that need to process web content will find this useful.

Use this if you need to feed web content into an LLM and require a fast, reliable way to convert HTML to high-quality Markdown.

Not ideal if your primary goal is general-purpose HTML parsing for display in a browser or detailed DOM manipulation, rather than LLM input.

LLM-data-preparation web-scraping AI-content-ingestion data-cleaning developer-tooling
No Package No Dependents
Maintenance 10 / 25
Adoption 7 / 25
Maturity 13 / 25
Community 4 / 25

How are scores calculated?

Stars

32

Forks

1

Language

TypeScript

License

Last pushed

Mar 03, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/Nano-Collective/get-md"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.