hankcs/hanlp-lucene-plugin
HanLP中文分词Lucene插件,支持包括Solr在内的基于Lucene的系统
This tool helps improve search accuracy for Chinese text in systems like Solr. It takes raw Chinese text and breaks it down into individual words and phrases, which are essential for effective search and retrieval. Content managers, data analysts, or anyone managing large volumes of Chinese text data would use this.
299 stars. No commits in the last 6 months.
Use this if you need accurate and flexible Chinese word segmentation for search engines or information retrieval systems built on Lucene or Solr.
Not ideal if your system does not use Lucene or Solr, or if you primarily work with languages other than Chinese.
Stars
299
Forks
100
Language
Java
License
Apache-2.0
Category
Last pushed
Oct 13, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/hankcs/hanlp-lucene-plugin"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PyThaiNLP/pythainlp
Thai natural language processing in Python
hankcs/HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named...
jacksonllee/pycantonese
Cantonese Linguistics and NLP
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
hankcs/pyhanlp
中文分词