davidsbatista/Annotated-Semantic-Relationships-Datasets
A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)
This collection provides pre-annotated text datasets for training systems that automatically identify how different entities or concepts are related within sentences. It helps researchers and data scientists working with natural language processing to build models that can, for example, determine if two proteins interact or if a drug causes an adverse effect. The datasets consist of raw text inputs with specific relationships between words or phrases already labeled, ready for use in machine learning workflows.
704 stars. No commits in the last 6 months.
Use this if you need pre-labeled text data to train a model that extracts specific types of relationships between entities from unstructured text, especially in English or Portuguese.
Not ideal if you are looking for raw, unannotated text corpora or datasets for tasks other than semantic relationship extraction.
Stars
704
Forks
132
Language
—
License
—
Category
Last pushed
Jul 07, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/davidsbatista/Annotated-Semantic-Relationships-Datasets"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
MantisAI/nervaluate
Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13
dice-group/gerbil
GERBIL - General Entity annotatoR Benchmark
bltlab/seqscore
SeqScore: Scoring for named entity recognition and other sequence labeling tasks
syuoni/eznlp
Easy Natural Language Processing
LHNCBC/metamaplite
A near real-time named-entity recognizer