jchunk-io/jchunk
JChunk is a lightweight and flexible library designed to provide multiple strategies for text chunking within Java applications
This is a lightweight and flexible Java library that helps developers break down large blocks of text into smaller, manageable pieces. It takes raw text as input and outputs segmented text chunks, which is crucial for building applications that need to process or search through text efficiently. It's designed for Java developers building RAG (Retrieval Augmented Generation) applications or any system requiring text segmentation.
Use this if you are a Java developer building an application that needs to efficiently split large text documents into smaller chunks for tasks like search or AI model input.
Not ideal if you are not a Java developer or if your text processing needs do not involve advanced chunking strategies for RAG-like applications.
Stars
17
Forks
4
Language
Java
License
Apache-2.0
Category
Last pushed
Mar 03, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/jchunk-io/jchunk"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
chonkie-inc/chonkie
🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust...
speedyk-005/chunklet-py
One library to split them all: Sentence, Code, Docs. Chunk smarter, not harder — built for LLMs,...
andreshere00/Splitter_MR
Chunk your data into markdown text blocks for your LLM applications
chonkie-inc/chonkiejs
🦛 CHONK your texts with Chonkie ✨ Type-friendly, light-weight, fast and super-simple chunking library
thom-heinrich/chonkify
Extractive document compression for RAG and agent pipelines. +69% vs LLMLingua, +175% vs...