Alexzsh/FDDC
Named Entity Recognition & Relation Extraction 实体命名识别与关系分类
This project helps operations managers, legal professionals, and financial analysts efficiently extract key details from unstructured HTML contract documents. It takes raw HTML files and associated training data as input, then identifies and categorizes specific entities like 'hetong' (contract), 'jiafang' (Party A), 'xiangmu' (project), and 'yifang' (Party B) within the text. The output provides these identified entities along with their exact location in the original document.
No commits in the last 6 months.
Use this if you need to automatically pull out specific types of information and relationships from Chinese contract-like HTML documents, such as identifying the parties involved or the project names.
Not ideal if your documents are in a language other than Chinese, are primarily tables, images, or highly complex, non-standard layouts, or if you need to process scanned paper documents.
Stars
32
Forks
7
Language
HTML
License
Apache-2.0
Category
Last pushed
Jan 06, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Alexzsh/FDDC"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
davidsbatista/BREDS
"Bootstrapping Relationship Extractors with Distributional Semantics" (Batista et al., 2015) in...
davidsbatista/Snowball
Implementation with some extensions of the paper "Snowball: Extracting Relations from Large...
nicolay-r/AREkit
Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing...
plkmo/BERT-Relation-Extraction
PyTorch implementation for "Matching the Blanks: Distributional Similarity for Relation Learning" paper
thunlp/FewRel
A Large-Scale Few-Shot Relation Extraction Dataset