chonkie and RAG-chunker

These are competitors in the document-chunking space, as both aim to split and prepare documents for RAG pipelines, though Chonkie is a mature, production-ready library while RAG-chunker appears to be an early-stage project with minimal adoption.

chonkie
80
Verified
RAG-chunker
31
Emerging
Maintenance 22/25
Adoption 15/25
Maturity 25/25
Community 18/25
Maintenance 2/25
Adoption 2/25
Maturity 15/25
Community 12/25
Stars: 3,829
Forks: 256
Downloads:
Commits (30d): 82
Language: Python
License: MIT
Stars: 2
Forks: 1
Downloads:
Commits (30d): 0
Language: Python
License: Apache-2.0
No risk flags
Stale 6m No Package No Dependents

About chonkie

chonkie-inc/chonkie

🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines

This is a lightweight tool for developers building Retrieval-Augmented Generation (RAG) applications. It takes various forms of text data, processes it by intelligently splitting it into smaller, meaningful parts (chunks), and then refines and embeds these chunks. The output is optimized text chunks ready to be stored in a vector database for efficient retrieval by large language models.

RAG development LLM application development text preprocessing vector database integration AI application engineering

About RAG-chunker

AceAtDev/RAG-chunker

The easiest and most effective way tool to retrain a RAG LLM/GEN AI/Agent on your data

Scores updated daily from GitHub, PyPI, and npm data. How scores work