Jing-yilin/E2M

E2M API, converting everything to markdown (LLM-friendly Format).

41
/ 100
Emerging

This tool helps AI developers and data scientists convert various unstructured documents and web content into clean, machine-readable Markdown or JSON formats. You input diverse file types like PDFs, web pages, or images, and it outputs structured text that large language models (LLMs) can easily process. This is ideal for anyone building AI knowledge bases or datasets who needs to standardize their input data.

139 stars. No commits in the last 6 months.

Use this if you need to transform raw, unstructured information from many sources into a consistent format suitable for AI applications and large language models.

Not ideal if you only need simple text extraction without the specific structuring for AI models or if you require advanced document analysis beyond format conversion.

AI data preparation LLM training data knowledge base construction document processing unstructured data conversion
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 15 / 25

How are scores calculated?

Stars

139

Forks

17

Language

Python

License

Apache-2.0

Last pushed

Dec 12, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/Jing-yilin/E2M"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.