yaoguangluo/Deta_Parser
快速中文分词分析word segmentation
This tool helps you break down Chinese and mixed-language texts into individual words, a crucial first step for many text analysis tasks. It takes raw text as input and outputs segmented words, along with their grammatical roles (parts of speech). This is designed for researchers, analysts, or anyone working with large volumes of Chinese or multilingual text who needs to accurately identify words for deeper linguistic analysis, sentiment analysis, or information extraction.
477 stars. No commits in the last 6 months.
Use this if you need extremely fast and accurate word segmentation and part-of-speech tagging for Chinese and other languages, even with complex or grammatically challenging sentences.
Not ideal if your primary need is general-purpose, high-quality machine translation, as it focuses specifically on segmentation and tagging rather than full sentence translation.
Stars
477
Forks
87
Language
Java
License
GPL-2.0
Category
Last pushed
Oct 01, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/yaoguangluo/Deta_Parser"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
PyThaiNLP/pythainlp
Thai natural language processing in Python
hankcs/HanLP
Natural Language Processing for the next decade. Tokenization, Part-of-Speech Tagging, Named...
jacksonllee/pycantonese
Cantonese Linguistics and NLP
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
hankcs/pyhanlp
中文分词