vakharwalad23/mark-minion

The Ultimate Web Content Extraction & Conversion Tool for AI/LLM Applications. Convert almost any web content into clean Markdown with intelligent AI processing.

29
/ 100
Experimental

Need to feed clean, structured content from various online sources into your AI models or applications? This tool takes almost any web content—like web pages, documents, videos, social media posts, and Google Docs—and converts it into clean Markdown or JSON format. It intelligently filters out clutter like ads, providing ready-to-use data for tasks like content analysis or training AI.

No commits in the last 6 months.

Use this if you are a data scientist, content strategist, or researcher who needs to systematically gather and clean diverse online content for AI model training, content analysis, or database population.

Not ideal if you need a real-time, high-volume scraping solution without any usage limits, as the free plan may encounter daily processing caps for browser-rendered sites.

content-curation data-acquisition ai-training-data web-research digital-librarian
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 6 / 25

How are scores calculated?

Stars

12

Forks

1

Language

TypeScript

License

MIT

Last pushed

Oct 08, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/perception/vakharwalad23/mark-minion"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.