jacksonllee/pycantonese

Cantonese Linguistics and NLP

64
/ 100
Established

PyCantonese helps linguists, researchers, or anyone interested in the Cantonese language to analyze Cantonese text. You can input raw Cantonese text or corpus data and it provides tools for word segmentation, part-of-speech tagging, and converting between different romanization systems like Jyutping. This is ideal for academics and commercial organizations studying or processing Cantonese.

400 stars. Available on PyPI.

Use this if you need to programmatically analyze Cantonese text for linguistic research, build language-learning tools, or process Cantonese data for natural language understanding applications.

Not ideal if you need a pre-built application for end-users rather than a programmatic library for text analysis.

Cantonese-linguistics NLP text-analysis language-research Jyutping
Maintenance 13 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 16 / 25

How are scores calculated?

Stars

400

Forks

43

Language

Python

License

MIT

Last pushed

Mar 15, 2026

Commits (30d)

0

Dependencies

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/jacksonllee/pycantonese"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.