davidsbatista/Annotated-Semantic-Relationships-Datasets

A collections of public and free annotated datasets of relationships between entities/nominals (Portuguese and English)

42
/ 100
Emerging

This collection provides pre-annotated text datasets for training systems that automatically identify how different entities or concepts are related within sentences. It helps researchers and data scientists working with natural language processing to build models that can, for example, determine if two proteins interact or if a drug causes an adverse effect. The datasets consist of raw text inputs with specific relationships between words or phrases already labeled, ready for use in machine learning workflows.

704 stars. No commits in the last 6 months.

Use this if you need pre-labeled text data to train a model that extracts specific types of relationships between entities from unstructured text, especially in English or Portuguese.

Not ideal if you are looking for raw, unannotated text corpora or datasets for tasks other than semantic relationship extraction.

natural-language-processing information-extraction text-mining computational-linguistics data-annotation
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 24 / 25

How are scores calculated?

Stars

704

Forks

132

Language

License

Last pushed

Jul 07, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/davidsbatista/Annotated-Semantic-Relationships-Datasets"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.