daviddongkc/DocOIE

Released Code for ACL 21 paper: DocOIE A Document-level Context-Aware Dataset for OpenIE

28
/ 100
Experimental

This project provides a specialized dataset and model for Open Information Extraction (OpenIE) specifically designed for document-level understanding. It helps developers and researchers working with large text documents, particularly in healthcare and transportation, to extract factual information more accurately by considering the broader context of a document, rather than just isolated sentences. It takes raw text documents as input and outputs structured facts or relationships.

No commits in the last 6 months.

Use this if you are a developer or researcher building systems that need to extract factual information from full documents, especially in the healthcare or transportation domains, where sentence context is crucial.

Not ideal if you are looking for a simple, off-the-shelf tool for general information extraction from short texts, or if you are not comfortable working with machine learning models and datasets.

information-extraction natural-language-processing text-analytics machine-learning-research healthcare-nlp
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 6 / 25
Maturity 8 / 25
Community 14 / 25

How are scores calculated?

Stars

15

Forks

3

Language

Python

License

Last pushed

Nov 25, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/daviddongkc/DocOIE"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.