AstraZeneca/VecNER

A library of tools for dictionary-based Named Entity Recognition (NER), based on word vector representations to expand dictionary terms.

27
/ 100
Experimental

This project helps domain experts like scientists, marketers, or analysts automatically find specific terms and concepts in large amounts of text. You provide a list of terms relevant to your field, and it identifies not only those exact terms but also similar, related phrases in your documents. The output is your text with the identified key terms and concepts highlighted and categorized.

No commits in the last 6 months.

Use this if you need to extract specific, domain-related entities from text, especially when your initial list of terms is limited or you want to discover related concepts within your specialized data.

Not ideal if you need a pre-trained general-purpose entity recognition system and do not have a specialized lexicon or a large domain-specific text corpus to leverage.

text-analysis biomedical-research customer-feedback-analysis document-intelligence market-research
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 4 / 25

How are scores calculated?

Stars

26

Forks

1

Language

Python

License

Apache-2.0

Last pushed

Jul 25, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/AstraZeneca/VecNER"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.