microsoft/markitdown
Python tool for converting files and office documents to Markdown.
MarkItDown helps data scientists, researchers, and AI developers prepare various document types for Large Language Models (LLMs). It takes common formats like PDFs, Word documents, PowerPoint presentations, or even YouTube URLs, and converts them into structured Markdown text. The output preserves key structural elements like headings and tables, making it ideal for text analysis pipelines and LLM ingestion.
90,677 stars. Used by 28 other packages. Actively maintained with 2 commits in the last 30 days. Available on PyPI.
Use this if you need to convert a wide range of file types into a structured, LLM-friendly Markdown format for text analysis or AI model input.
Not ideal if you need high-fidelity document conversions for human consumption where original formatting and visual layout are critical.
Stars
90,677
Forks
5,354
Language
Python
License
MIT
Category
Last pushed
Mar 10, 2026
Commits (30d)
2
Dependencies
6
Reverse dependents
28
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/microsoft/markitdown"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Recent Releases
Related tools
doocs/md
✍ WeChat Markdown Editor | 一款高度简洁的微信 Markdown 编辑器:支持 Markdown 语法、自定义主题样式、内容管理、多图床、AI 助手等特性
AIDotNet/OpenDeepWiki
OpenDeepWiki is the open-source version of the DeepWiki project, aiming to provide a powerful...
hyperfield/ai-file-sorter
Cross-platform desktop application for content-aware file organization and renaming. Supports...
drl990114/MarkFlowy
The AI Markdown Editor
shoryasethia/markdrop
A Python package for converting PDFs to markdown while extracting images and tables, generate...