CanCLID/awesome-cantonese-nlp
A curated list of resources dedicated to Natural Language Processing (NLP) of Cantonese | 粵語 NLP
This is a curated collection of resources for working with Cantonese in natural language processing. It provides links to various Cantonese text datasets (corpora) and software tools for tasks like pronunciation labeling. Researchers, linguists, and computational linguists focusing on the Cantonese language would find this useful for their studies and projects.
No commits in the last 6 months.
Use this if you need to find existing Cantonese datasets or specialized tools for linguistic analysis or building applications that understand and process Cantonese.
Not ideal if you are looking for ready-to-use, off-the-shelf Cantonese NLP models or applications without needing to delve into underlying data or tools.
Stars
92
Forks
4
Language
—
License
CC-BY-4.0
Category
Last pushed
Oct 17, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/CanCLID/awesome-cantonese-nlp"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
gentaiscool/code-switching-papers
A curated list of research papers and resources on code-switching
RichardLitt/low-resource-languages
Resources for conservation, development, and documentation of low resource (human) languages.
UCREL/pymusas-models
PyMUSAS Models
ksopyla/awesome-nlp-polish
A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models,...
datanada/Awesome-Korean-NLP
A curated list of resources for NLP (Natural Language Processing) for Korean