nirholas/extract-llms-docs
Extract documentation for AI agents from any site with llms.txt support. Features MCP server, REST API, batch processing, and multiple export formats.
This project helps AI agent builders and large language model (LLM) developers quickly get up-to-date, structured documentation from websites. It takes any website URL that uses the 'llms.txt' or 'install.md' standard and outputs organized, machine-readable documentation in formats like Markdown, JSON, or YAML. This is for anyone building, training, or fine-tuning AI agents and LLMs who needs high-quality, current data.
Available on npm.
Use this if you need to reliably extract, organize, and prepare website documentation for use with AI agents, LLMs, or automated documentation pipelines.
Not ideal if you are looking to extract general content from any website, as it specifically targets sites using the 'llms.txt' and 'install.md' standards.
Stars
14
Forks
3
Language
TypeScript
License
MIT
Category
Last pushed
Mar 03, 2026
Commits (30d)
0
Dependencies
17
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/mcp/nirholas/extract-llms-docs"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
thedaviddias/mcp-llms-txt-explorer
MCP to explore websites with llms.txt files
jonigl/ollama-mcp-bridge
Extend the Ollama API with dynamic AI tool integration from multiple MCP (Model Context...
sib-swiss/sparql-llm
🦜✨ Chat system, MCP server, and reusable components to improve LLMs capabilities when generating...
CodeLogicIncEngineering/codelogic-mcp-server
An MCP Server to utilize Codelogic's rich software dependency data in your AI programming assistant.
webworn/openfoam-mcp-server
LLM-powered OpenFOAM MCP server for intelligent CFD education with Socratic questioning and...