notoriouslab/trad-zh-search
trad-zh-search 可單獨搭配主流搜尋引擎,專門給繁體中文使用的繁體中文文本預處理工具 —— CKIP 分詞 + bigram 索引生成,附可選擇的領域字典系統
Building a search engine for Traditional Chinese content often leads to frustratingly inaccurate results because mainstream search tools struggle with proper word segmentation. This tool takes your Traditional Chinese text and processes it using advanced natural language processing, like CKIP segmentation and bigram indexing, to produce highly accurate search results. It's ideal for anyone managing a knowledge base, e-commerce site, or any platform with a large volume of Traditional Chinese text who needs precise search functionality.
Available on PyPI.
Use this if you manage a Traditional Chinese content platform and find that your current search engine frequently fails to return relevant results due to poor word recognition.
Not ideal if your content is primarily in Simplified Chinese or other languages, or if you only need a very basic keyword search without concern for linguistic precision.
Stars
15
Forks
—
Language
Python
License
MIT
Category
Last pushed
Mar 23, 2026
Commits (30d)
0
Dependencies
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/notoriouslab/trad-zh-search"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PyThaiNLP/pythainlp
Thai natural language processing in Python
hankcs/HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named...
jacksonllee/pycantonese
Cantonese Linguistics and NLP
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
hankcs/pyhanlp
中文分词