lionsoul2014/jcseg
Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction implemented based on TEXTRANK algorithm. Jcseg had a build-in http server and search modules for lucene,solr,elasticsearch,opensearch
This tool helps anyone working with Chinese text process it effectively for search, analysis, or content generation. It takes raw Chinese text and breaks it down into individual words or phrases, and can also identify important keywords, key sentences, or even create automatic summaries. It's designed for content analysts, researchers, or anyone building applications that need to understand or extract meaning from large volumes of Chinese documents.
920 stars. No commits in the last 6 months.
Use this if you need to precisely segment Chinese text, identify key information within it, or generate concise summaries for applications like search engines, content recommendation, or data analysis.
Not ideal if your primary need is highly accurate automatic part-of-speech tagging, as this feature is still under development.
Stars
920
Forks
211
Language
Java
License
Apache-2.0
Category
Last pushed
Sep 18, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/lionsoul2014/jcseg"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PyThaiNLP/pythainlp
Thai natural language processing in Python
hankcs/HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named...
jacksonllee/pycantonese
Cantonese Linguistics and NLP
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
hankcs/pyhanlp
中文分词