yumeng5/RoSTER

[EMNLP 2021] Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training

35
/ 100
Emerging

This project helps natural language processing researchers train models to identify specific entities like names or locations in text, even when the initial training data has errors or 'noise.' You provide raw text documents and a corresponding set of 'distant labels' (automatically generated or imperfect labels). The output is a highly accurate named entity recognition (NER) model, ready to be deployed or further refined.

No commits in the last 6 months.

Use this if you need to build a robust Named Entity Recognition model for your text data but only have access to distantly supervised or noisy training labels.

Not ideal if you have a perfectly clean, manually annotated dataset for your NER task, as its specialized noise-robust features would be overkill.

Natural Language Processing Named Entity Recognition Information Extraction Text Annotation Machine Learning Research
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 16 / 25
Community 11 / 25

How are scores calculated?

Stars

65

Forks

7

Language

Python

License

Apache-2.0

Last pushed

Nov 12, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/yumeng5/RoSTER"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.