jiesutd/NCRFpp

NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.

/ 100

Established

This project helps you automatically categorize words and phrases within text, like identifying all person names or locations in a document, or tagging parts of speech. You provide text data with examples of how words should be labeled, and it produces a system that can accurately tag new, unseen text. It's ideal for data scientists or researchers who need to perform highly accurate text analysis for tasks like information extraction or text structuring.

1,897 stars. No commits in the last 6 months.

Use this if you need to precisely label sequences of words in text, such as for Named Entity Recognition (NER), part-of-speech tagging, or word segmentation, and want a high-performance solution without writing complex code.

Not ideal if you're looking for a simple keyword extraction tool or a full natural language understanding pipeline that includes components beyond sequence labeling.

Natural Language Processing Information Extraction Text Analytics Named Entity Recognition Linguistics

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 25 / 25

How are scores calculated?

Stars

1,897

Forks

442

Language

Python

License

Apache-2.0

Related tools

facebookresearch/stopes

A library for preparing data for machine translation research (monolingual preprocessing,...

Droidtown/ArticutAPI

API of Articut 中文斷詞 (兼具語意詞性標記)：「斷詞」又稱「分詞」，是中文資訊處理的基礎。Articut 不用機器學習，不需資料模型，只用現代白話中文語法規則，即能達到...

rkcosmos/deepcut

A Thai word tokenization library using Deep Neural Network

fukuball/jieba-php

"結巴"中文分詞：做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation:...

pytorch/text

Models, data loaders and abstractions for language processing, powered by PyTorch

Explore NLP Tools

All categories Trending NLP directory Insights