hankcs/multi-criteria-cws

Simple Solution for Multi-Criteria Chinese Word Segmentation

49
/ 100
Emerging

This tool helps researchers and computational linguists accurately segment Chinese text into words, which is crucial for natural language processing tasks. It takes raw Chinese text or pre-existing corpora as input and outputs segmented text, ready for further analysis or model training. This is ideal for anyone working on Chinese language data.

303 stars. No commits in the last 6 months.

Use this if you need to perform high-quality Chinese word segmentation for research, academic projects, or building NLP applications.

Not ideal if you need a simple, ready-to-use API for Chinese word segmentation without any setup, or if you don't have access to relevant Chinese corpora.

Chinese NLP computational linguistics text segmentation corpus annotation language research
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 23 / 25

How are scores calculated?

Stars

303

Forks

81

Language

Python

License

GPL-3.0

Last pushed

Aug 12, 2020

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/hankcs/multi-criteria-cws"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.