chu2bard/chunkflow

Document chunking pipeline for RAG applications

26
/ 100
Experimental

This tool helps AI application builders prepare long text documents for use with retrieval-augmented generation (RAG) systems. It takes a lengthy document, like a report or article, and breaks it down into smaller, manageable pieces (chunks) using various strategies. Developers building AI applications, especially those incorporating RAG, would find this useful for pre-processing their text data.

Use this if you are a developer building a RAG application and need flexible, robust ways to split your source documents into suitable chunks.

Not ideal if you are an end-user simply looking to summarize or analyze a document without developing a RAG system.

AI-application-development RAG-system-building text-pre-processing natural-language-processing document-preparation
No Package No Dependents
Maintenance 10 / 25
Adoption 5 / 25
Maturity 11 / 25
Community 0 / 25

How are scores calculated?

Stars

13

Forks

Language

Python

License

MIT

Last pushed

Feb 11, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/chu2bard/chunkflow"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.