yaleimeng/NER_corpus_chinese
NER(命名实体识别)中文语料,一站式获取
This project helps data scientists, NLP researchers, and machine learning engineers working with Chinese text easily find and use pre-annotated Chinese text datasets for Named Entity Recognition (NER). It provides access to various established Chinese NER corpora, allowing you to train models to identify specific types of entities like person names, locations, and organization names within text.
130 stars. No commits in the last 6 months.
Use this if you need to quickly access and utilize existing, publicly available Chinese NER datasets for training or evaluating your own entity recognition models.
Not ideal if you require highly specialized, domain-specific NER datasets that are not publicly available or if you need to create entirely new annotations from scratch.
Stars
130
Forks
35
Language
—
License
MIT
Category
Last pushed
Sep 10, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/yaleimeng/NER_corpus_chinese"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
hellohaptik/chatbot_ner
chatbot_ner: Named Entity Recognition for chatbots.
openeventdata/mordecai
Full text geoparsing as a Python library
Rostlab/nalaf
NLP framework in python for entity recognition and relationship extraction
mpuig/spacy-lookup
Named Entity Recognition based on dictionaries
NorskRegnesentral/skweak
skweak: A software toolkit for weak supervision applied to NLP tasks