andreshere00/Splitter_MR

Chunk your data into markdown text blocks for your LLM applications

44
/ 100
Emerging

This tool helps developers working with large language models (LLMs) to prepare various types of data for their applications. It takes in diverse file formats like text, PDFs, Office documents, JSON, or images, processes them, and outputs organized chunks of text in Markdown format. This is ideal for developers building LLM-powered applications who need to efficiently manage and segment source data.

Available on PyPI.

Use this if you need to reliably break down unstructured data from many file types into manageable, semantically coherent text blocks for your LLM applications.

Not ideal if you only work with small, pre-formatted text segments or do not develop applications using large language models.

LLM development data preprocessing document parsing text chunking AI application development
Maintenance 6 / 25
Adoption 7 / 25
Maturity 24 / 25
Community 7 / 25

How are scores calculated?

Stars

25

Forks

2

Language

Jupyter Notebook

License

MIT

Last pushed

Jan 08, 2026

Commits (30d)

0

Dependencies

18

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/andreshere00/Splitter_MR"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.