pgEdge/pgedge-docloader
A tool for converting HTML and RST docs into Markdown, and loading them into PostgreSQL.
This tool helps convert various document types like HTML, reStructuredText, and SGML into a standardized Markdown format and then loads them into a PostgreSQL database. It takes your existing documentation files or Git repositories as input and outputs structured content within your database, ready for querying or further processing. This is ideal for technical writers, content managers, or operations engineers who need to centralize and manage documentation from diverse sources.
Use this if you need to consolidate documentation from different formats (HTML, RST, SGML) into a PostgreSQL database for unified access and management.
Not ideal if your primary need is simply to convert documents between formats without storing them in a database, or if your documents are in unsupported formats like PDF or Microsoft Word.
Stars
10
Forks
—
Language
Go
License
PostgreSQL
Category
Last pushed
Feb 03, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/pgEdge/pgedge-docloader"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
any4ai/AnyCrawl
AnyCrawl π: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts...
kreuzberg-dev/html-to-markdown
High performance and CommonMark compliant HTML to Markdown converter. Maintained by the...
ScrapeGraphAI/Scrapegraph-ai
Python scraper based on AI
adbar/trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping,...
paulpierre/markdown-crawler
A multithreaded πΈοΈ web crawler that recursively crawls a website and creates a π½ markdown file...