UKPLab/nessie

Automatically detect errors in annotated corpora.

/ 100

Emerging

When you're building datasets by hand for machine learning, this tool helps you automatically find mistakes in your annotated text, tokens, or spans. It takes your existing labeled data and identifies potentially incorrect annotations, allowing you to prioritize which instances to review and correct. This is for anyone who creates or manages human-annotated text datasets, such as data scientists, linguists, or quality assurance specialists.

No commits in the last 6 months.

Use this if you need to improve the quality and efficiency of your human data annotation process by automatically flagging potential errors in your datasets.

Not ideal if you are looking for a tool to perform the initial annotation of your data, as this is designed for error detection in already-annotated corpora.

data-annotation natural-language-processing dataset-quality text-labeling linguistic-annotation

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

chakki-works/seqeval

A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)

Hironsan/anago

Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.

jbesomi/texthero

Text preprocessing, representation and visualization from zero to hero.

hamelsmu/ktext

Utilities for preprocessing text for deep learning with Keras

asahi417/tner

Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An...

Explore NLP Tools

All categories Trending NLP directory Insights