malllabiisc/RESIDE
EMNLP 2018: RESIDE: Improving Distantly-Supervised Neural Relation Extraction using Side Information
This project helps knowledge engineers and data analysts automatically identify specific relationships between entities mentioned in large text corpora, like news articles or academic papers. It takes raw text and existing knowledge graph information as input, then outputs extracted relationships. The primary users are researchers or practitioners working with information extraction from unstructured text to build or enrich knowledge bases.
250 stars. No commits in the last 6 months.
Use this if you need to extract specific types of relationships (e.g., 'located in', 'employs') between entities from a large collection of sentences, especially when you can leverage existing side information like entity types or relation aliases to improve accuracy.
Not ideal if your goal is general text summarization, sentiment analysis, or if you do not have access to any form of 'side information' to aid the relation extraction process.
Stars
250
Forks
48
Language
CSS
License
Apache-2.0
Category
Last pushed
Mar 24, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/malllabiisc/RESIDE"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
facebookresearch/stopes
A library for preparing data for machine translation research (monolingual preprocessing,...
Droidtown/ArticutAPI
API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut 不用機器學習,不需資料模型,只用現代白話中文語法規則,即能達到...
rkcosmos/deepcut
A Thai word tokenization library using Deep Neural Network
fukuball/jieba-php
"結巴"中文分詞:做最好的 PHP 中文分詞、中文斷詞組件。 / "Jieba" (Chinese for "to stutter") Chinese text segmentation:...
pytorch/text
Models, data loaders and abstractions for language processing, powered by PyTorch