AstraZeneca/VecNER
A library of tools for dictionary-based Named Entity Recognition (NER), based on word vector representations to expand dictionary terms.
This project helps domain experts like scientists, marketers, or analysts automatically find specific terms and concepts in large amounts of text. You provide a list of terms relevant to your field, and it identifies not only those exact terms but also similar, related phrases in your documents. The output is your text with the identified key terms and concepts highlighted and categorized.
No commits in the last 6 months.
Use this if you need to extract specific, domain-related entities from text, especially when your initial list of terms is limited or you want to discover related concepts within your specialized data.
Not ideal if you need a pre-trained general-purpose entity recognition system and do not have a specialized lexicon or a large domain-specific text corpus to leverage.
Stars
26
Forks
1
Language
Python
License
Apache-2.0
Category
Last pushed
Jul 25, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/AstraZeneca/VecNER"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
MantisAI/nervaluate
Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13
dice-group/gerbil
GERBIL - General Entity annotatoR Benchmark
bltlab/seqscore
SeqScore: Scoring for named entity recognition and other sequence labeling tasks
syuoni/eznlp
Easy Natural Language Processing
LHNCBC/metamaplite
A near real-time named-entity recognizer