Yinghao-Li/GnO-IE

Code for "A Simple but Effective Approach to Improve Structured Language Model Output for Information Extraction"

/ 100

Emerging

This project helps researchers and data scientists extract specific pieces of information from unstructured text, like identifying diseases in medical reports or relationships between entities in scientific papers. It takes raw text as input and outputs structured data, such as lists of named entities or identified relationships, which can then be used for further analysis or database population. This tool is ideal for those working with large volumes of domain-specific text who need to automate data extraction tasks.

No commits in the last 6 months.

Use this if you need to reliably extract specific facts or relationships from text documents using large language models, especially when precision and accuracy of structured output are critical.

Not ideal if you're looking for a simple, out-of-the-box application for general text summarization or generation, or if you don't have access to large language models like GPT or Llama.

information-extraction text-analysis natural-language-processing data-structuring research-automation

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

—

License

Apache-2.0

Higher-rated alternatives

williamliujl/CMExam

A Chinese National Medical Licensing Examination dataset and large languge model benchmarks

zjunlp/IEPile

[ACL 2024] IEPile: A Large-Scale Information Extraction Corpus

StefanHeng/ProgGen

Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with...

MaheshJakkala/naamapadam-multilingual-ner

Benchmarking NER on Naamapadam across 7 Indic languages. EDA + model training for...

yaoyiran/BLI-Reading-List

A 2024 Reading List for Bilingual Lexicon Induction (BLI) / Word Translation. Frequently Updated.

Explore NLP Tools

All categories Trending NLP directory Insights