KoichiYasuoka/UD-Kanbun
Tokenizer POS-tagger and Dependency-parser for Classical Chinese
This tool helps researchers and students of Classical Chinese (漢文/文言文) by automatically breaking down texts into individual words, identifying their parts of speech, and mapping the grammatical relationships between them. You input a Classical Chinese text, and it outputs a detailed linguistic analysis, including glosses and dependency trees. Anyone studying or analyzing ancient Chinese texts for academic or research purposes would find this valuable.
Use this if you need to perform detailed linguistic analysis, such as part-of-speech tagging or dependency parsing, on Classical Chinese texts to understand their grammatical structure.
Not ideal if you are looking for translation, stylistic analysis, or sentiment analysis of modern Chinese.
Stars
71
Forks
8
Language
Python
License
MIT
Category
Last pushed
Feb 23, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/KoichiYasuoka/UD-Kanbun"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PyThaiNLP/pythainlp
Thai natural language processing in Python
hankcs/HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named...
jacksonllee/pycantonese
Cantonese Linguistics and NLP
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
hankcs/pyhanlp
中文分词