Alexzsh/FDDC

Named Entity Recognition & Relation Extraction 实体命名识别与关系分类

/ 100

Emerging

This project helps operations managers, legal professionals, and financial analysts efficiently extract key details from unstructured HTML contract documents. It takes raw HTML files and associated training data as input, then identifies and categorizes specific entities like 'hetong' (contract), 'jiafang' (Party A), 'xiangmu' (project), and 'yifang' (Party B) within the text. The output provides these identified entities along with their exact location in the original document.

No commits in the last 6 months.

Use this if you need to automatically pull out specific types of information and relationships from Chinese contract-like HTML documents, such as identifying the parties involved or the project names.

Not ideal if your documents are in a language other than Chinese, are primarily tables, images, or highly complex, non-standard layouts, or if you need to process scanned paper documents.

contract-analysis legal-document-processing information-extraction Chinese-documents data-structuring

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

HTML

License

Apache-2.0

Higher-rated alternatives

davidsbatista/BREDS

"Bootstrapping Relationship Extractors with Distributional Semantics" (Batista et al., 2015) in...

davidsbatista/Snowball

Implementation with some extensions of the paper "Snowball: Extracting Relations from Large...

nicolay-r/AREkit

Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing...

plkmo/BERT-Relation-Extraction

PyTorch implementation for "Matching the Blanks: Distributional Similarity for Relation Learning" paper

thunlp/FewRel

A Large-Scale Few-Shot Relation Extraction Dataset

Explore NLP Tools

All categories Trending NLP directory Insights