davidsbatista/Annotated-Semantic-Relationships-Datasets

A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)

/ 100

Emerging

This collection provides pre-annotated text datasets for training systems that automatically identify how different entities or concepts are related within sentences. It helps researchers and data scientists working with natural language processing to build models that can, for example, determine if two proteins interact or if a drug causes an adverse effect. The datasets consist of raw text inputs with specific relationships between words or phrases already labeled, ready for use in machine learning workflows.

704 stars. No commits in the last 6 months.

Use this if you need pre-labeled text data to train a model that extracts specific types of relationships between entities from unstructured text, especially in English or Portuguese.

Not ideal if you are looking for raw, unannotated text corpora or datasets for tasks other than semantic relationship extraction.

natural-language-processing information-extraction text-mining computational-linguistics data-annotation

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 24 / 25

How are scores calculated?

Stars

704

Forks

132

Language

—

License

—

Higher-rated alternatives

MantisAI/nervaluate

Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13

dice-group/gerbil

GERBIL - General Entity annotatoR Benchmark

bltlab/seqscore

SeqScore: Scoring for named entity recognition and other sequence labeling tasks

syuoni/eznlp

Easy Natural Language Processing

LHNCBC/metamaplite

A near real-time named-entity recognizer

Explore NLP Tools

All categories Trending NLP directory Insights