vj1494/PipelineIE

PipelineIE is a project that contains a pipeline for information extraction (currently triple) from free text and domain specific text (eg. biomedical domain) and also supports custom models making it flexible to support other domains. It takes care of coreference resolution and entity resolution by also allowing to test with different tools.

27
/ 100
Experimental

This tool helps researchers and analysts extract structured information from unstructured text, especially in specialized fields like biomedicine. It takes raw text, identifies the true subjects and objects even if complex or referenced by pronouns, and outputs clear subject-verb-object triples. Anyone who needs to convert natural language documents into actionable data for analysis or databases would find this useful.

No commits in the last 6 months.

Use this if you need to precisely identify key relationships and entities within large volumes of domain-specific text, like scientific papers or reports.

Not ideal if your primary goal is general sentiment analysis or simply keyword extraction without needing detailed relational understanding.

biomedical-research text-analysis information-extraction data-mining knowledge-graph-building
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 6 / 25

How are scores calculated?

Stars

12

Forks

1

Language

Python

License

MIT

Last pushed

Mar 15, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/vj1494/PipelineIE"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.