hashangit/Extract2MD

Extract2MD is a powerful and versatile AI-enabled client-side JavaScript library for extracting text from PDF files and converting it into Markdown.

52
/ 100
Established

Need to turn PDFs into clean, structured Markdown for your notes, documentation, or content? This tool takes your PDF files, whether they have selectable text or are scanned images, and converts them into well-formatted Markdown. It’s ideal for technical writers, researchers, or content creators who frequently work with documents and need to extract their content into a flexible, plain-text format for editing or publishing.

105 stars. Available on npm.

Use this if you need to reliably convert a wide variety of PDF documents, including those with complex layouts or scanned content, into structured Markdown, optionally enhanced by AI for better readability and organization.

Not ideal if you need a desktop application or a server-side solution, as this is designed for client-side web browser environments.

document-conversion technical-writing content-extraction research-notes knowledge-management
Maintenance 10 / 25
Adoption 9 / 25
Maturity 24 / 25
Community 9 / 25

How are scores calculated?

Stars

105

Forks

6

Language

JavaScript

License

MIT

Last pushed

Feb 07, 2026

Commits (30d)

0

Dependencies

3

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/hashangit/Extract2MD"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.