jackfsuia/bert-chunker
bert-chunker: efficient and trained chunking for unstructured documents. 训练Bert做文档分段.
36
/ 100
Emerging
No Package
No Dependents
Maintenance
6 / 25
Adoption
4 / 25
Maturity
16 / 25
Community
10 / 25
Stars
6
Forks
1
Language
Python
License
MIT
Category
Last pushed
Jan 03, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/jackfsuia/bert-chunker"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
mirth/chonky
Fully neural approach for text chunking
51
sentencizer/sentencizer
A sentence splitting (sentence boundary disambiguation) library for Go. It is rule-based and...
42
prajwal10001/semantic-chunker-langchain
Token-aware, LangChain-compatible semantic chunker with PDF, markdown, and layout support
22
bgokden/fast-text-splitter
fast text splitter with onnx
21