vj1494/PipelineIE

PipelineIE is a project that contains a pipeline for information extraction (currently triple) from free text and domain specific text (eg. biomedical domain) and also supports custom models making it flexible to support other domains. It takes care of coreference resolution and entity resolution by also allowing to test with different tools.

/ 100

Experimental

This tool helps researchers and analysts extract structured information from unstructured text, especially in specialized fields like biomedicine. It takes raw text, identifies the true subjects and objects even if complex or referenced by pronouns, and outputs clear subject-verb-object triples. Anyone who needs to convert natural language documents into actionable data for analysis or databases would find this useful.

No commits in the last 6 months.

Use this if you need to precisely identify key relationships and entities within large volumes of domain-specific text, like scientific papers or reports.

Not ideal if your primary goal is general sentiment analysis or simply keyword extraction without needing detailed relational understanding.

biomedical-research text-analysis information-extraction data-mining knowledge-graph-building

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 6 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

zjunlp/OpenUE

[EMNLP 2020] OpenUE: An Open Toolkit of Universal Extraction from Text

OpenSextant/Xponents

Geographic Place, Date/time, and Pattern entity extraction toolkit along with text extraction...

BaptisteBlouin/EventExtractionPapers

A list of NLP resources focused on event extraction task

philipperemy/stanford-openie-python

Stanford Open Information Extraction made simple!

uma-pi1/minie

An open information extraction system that provides compact extractions

Explore NLP Tools

All categories Trending NLP directory Insights