mpuig/spacy-lookup

Named Entity Recognition based on dictionaries

54
/ 100
Established

This tool helps data analysts and researchers automatically identify and categorize specific terms or phrases within text documents, like job titles, product names, or scientific concepts. You provide a list of keywords or a dictionary of terms with their categories, and it outputs the original text with these terms highlighted and labeled. It's ideal for anyone who needs to quickly extract predefined information from large volumes of text.

239 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need to reliably find and classify exact words or phrases in text based on a custom list or dictionary.

Not ideal if you need to identify entities that aren't explicitly listed in your dictionary, such as variations, synonyms, or newly emerging terms.

text-analysis information-extraction document-processing data-labeling
Stale 6m
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 19 / 25

How are scores calculated?

Stars

239

Forks

38

Language

Python

License

MIT

Last pushed

Mar 03, 2019

Commits (30d)

0

Dependencies

2

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/mpuig/spacy-lookup"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.