INK-USC/PLE

Label Noise Reduction in Entity Typing (KDD'16)

/ 100

Emerging

This project helps identify and categorize entities (like people, organizations, or locations) within large volumes of text, even when the initial labels are inconsistent or inaccurate. It takes a text corpus with detected entities and rough labels as input and outputs the same text with refined, more accurate entity types. Researchers and data analysts working with text data who need highly precise entity recognition for downstream tasks would find this valuable.

No commits in the last 6 months.

Use this if you need to clean up noisy, automatically generated labels for entities in your text data to improve the accuracy of fine-grained entity typing.

Not ideal if you need an end-to-end tool that takes raw text and outputs typed entities directly, as entity detection and initial labeling are assumed to be pre-done.

text-analysis natural-language-processing entity-recognition information-extraction data-cleaning

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 18 / 25

How are scores calculated?

Stars

Forks

Language

C++

License

GPL-3.0

Higher-rated alternatives

davidsbatista/BREDS

"Bootstrapping Relationship Extractors with Distributional Semantics" (Batista et al., 2015) in...

davidsbatista/Snowball

Implementation with some extensions of the paper "Snowball: Extracting Relations from Large...

nicolay-r/AREkit

Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing...

plkmo/BERT-Relation-Extraction

PyTorch implementation for "Matching the Blanks: Distributional Similarity for Relation Learning" paper

thunlp/FewRel

A Large-Scale Few-Shot Relation Extraction Dataset

Explore NLP Tools

All categories Trending NLP directory Insights