generall/ExtWikilinks

Extended Wikilinks dataset description

14
/ 100
Experimental

This dataset helps natural language processing researchers build more accurate named entity linking systems. It takes raw text sentences and provides enriched information, including part-of-speech tags, lemmas, parse tags, and additional entity links beyond what was originally available. Researchers who are developing and evaluating named entity linking or disambiguation models would use this data.

No commits in the last 6 months.

Use this if you need a large, pre-processed dataset of English sentences with detailed linguistic annotations and extended entity mentions to train or evaluate named entity linking algorithms.

Not ideal if you need a dataset focused on languages other than English or if you require fine-grained annotations for tasks other than named entity linking.

natural-language-processing named-entity-linking linguistic-annotation text-analytics information-extraction
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 6 / 25
Maturity 8 / 25
Community 0 / 25

How are scores calculated?

Stars

15

Forks

Language

Jupyter Notebook

License

Last pushed

Apr 01, 2018

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/generall/ExtWikilinks"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.