howl-anderson/seq2annotation

基于 TensorFlow & PaddlePaddle 的通用序列标注算法库(目前包含 BiLSTM+CRF, Stacked-BiLSTM+CRF 和 IDCNN+CRF,更多算法正在持续添加中)实现中文分词(Tokenizer / segmentation)、词性标注(Part Of Speech, POS)和命名实体识别(Named Entity Recognition, NER)等序列标注任务。

53
/ 100
Established

This tool helps data scientists and NLP practitioners automatically extract structured information from Chinese text. You provide raw Chinese text, and it identifies and labels specific elements like individual words, their grammatical roles, and named entities (like people, places, or organizations). This is useful for anyone building systems that need to understand and process large volumes of Chinese language data.

No commits in the last 6 months. Available on PyPI.

Use this if you need to perform advanced text analysis tasks like Chinese word segmentation, part-of-speech tagging, or named entity recognition on your Chinese text data.

Not ideal if you are looking for a pre-built, production-ready API for these tasks, as this is a library for building your own solutions.

Chinese-NLP text-analysis information-extraction natural-language-processing text-segmentation
Stale 6m
Maintenance 0 / 25
Adoption 9 / 25
Maturity 25 / 25
Community 19 / 25

How are scores calculated?

Stars

86

Forks

21

Language

Python

License

Apache-2.0

Last pushed

Dec 08, 2022

Commits (30d)

0

Dependencies

14

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/howl-anderson/seq2annotation"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.