Semantic Chunking Embedding Tools
Tools for splitting text/code into semantically coherent chunks using embeddings, AST analysis, or similarity metrics for LLM processing. Does NOT include general tokenization, sentence splitting, or document parsing without semantic awareness.
There are 14 semantic chunking tools tracked. 1 score above 50 (established tier). The highest-rated is jparkerweb/semantic-chunking at 59/100 with 134 stars.
Get all 14 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=embeddings&subcategory=semantic-chunking&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
jparkerweb/semantic-chunking
🍱 semantic-chunking ⇢ semantically create chunks from large document for... |
|
Established |
| 2 |
drittich/SemanticSlicer
🧠✂️ SemanticSlicer — A smart text chunker for LLM-ready documents. |
|
Emerging |
| 3 |
smart-models/Normalized-Semantic-Chunker
Cutting-edge tool that unlocks the full potential of semantic chunking |
|
Emerging |
| 4 |
ndgigliotti/afterthoughts
Sentence-aware embeddings using late chunking with transformers. |
|
Emerging |
| 5 |
ReemHal/Semantic-Text-Segmentation-with-Embeddings
Uses GloVe embeddings and greedy sequence segmentation to semantically... |
|
Emerging |
| 6 |
agamm/semantic-split
A Python library to chunk/group your texts based on semantic similarity. |
|
Experimental |
| 7 |
chu2bard/chunkflow
Document chunking pipeline for RAG applications |
|
Experimental |
| 8 |
danielefrisanco/semantic_chunker
A lightweight Ruby library for splitting text into topically coherent chunks... |
|
Experimental |
| 9 |
zoobz-io/chisel
AST-aware code chunking for semantic search and embeddings |
|
Experimental |
| 10 |
njyeung/go-semantic-chunking
Sementic chunking algorithm in (mostly) Go |
|
Experimental |
| 11 |
SainathPattipati/advanced-chunking-strategies
Semantic, agentic, and contextual chunking strategies for RAG with... |
|
Experimental |
| 12 |
geleto/semachunk
Lightweight Semantic Chunking Library. Plug any embedding provider/API.... |
|
Experimental |
| 13 |
do-me/js-text-chunker
A simple vanilla JS text chunker for hierarchical semantic chunking |
|
Experimental |
| 14 |
gazelle93/Various-Chunking-Methods
Exploring and benchmarking chunking methods for Retrieval-Augmented... |
|
Experimental |