taishan1994/pytorch_triple_extraction

基于pytorch的中文三元组提取（命名实体识别+关系抽取）

/ 100

Emerging

This project helps operations engineers or data scientists extract structured information from unstructured Chinese text, particularly for technical reports like automobile fault descriptions. It takes raw Chinese text and a list of predefined relationships (e.g., '部件故障' - component failure, '性能故障' - performance issue) as input. The output is a list of subject-object-relation triplets, such as ('发动机', '熄火', '部件故障'), making it easier to build knowledge graphs or analyze textual data.

362 stars. No commits in the last 6 months.

Use this if you need to automatically identify entities (like 'engine' or 'failure') and the relationships between them (like 'engine' has a 'component failure' of 'stalling') from large volumes of Chinese text.

Not ideal if your data is not in Chinese or if you primarily need to extract entities without considering their relationships, or if you need a pre-trained model for a highly specialized domain without further training.

knowledge-graph-construction chinese-text-analysis fault-diagnosis text-mining information-extraction

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 18 / 25

How are scores calculated?

Stars

362

Forks

Language

Python

License

—

Higher-rated alternatives

davidsbatista/BREDS

"Bootstrapping Relationship Extractors with Distributional Semantics" (Batista et al., 2015) in...

davidsbatista/Snowball

Implementation with some extensions of the paper "Snowball: Extracting Relations from Large...

nicolay-r/AREkit

Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing...

plkmo/BERT-Relation-Extraction

PyTorch implementation for "Matching the Blanks: Distributional Similarity for Relation Learning" paper

thunlp/FewRel

A Large-Scale Few-Shot Relation Extraction Dataset

Explore NLP Tools

All categories Trending NLP directory Insights