prit2596/NLP-Template-Extraction
Template Extraction from unstructured Wikipedia text using NLP techniques.
This project helps you extract structured information from unstructured text, such as Wikipedia articles or news reports. You input raw text, and it outputs pre-defined templates containing key facts like who bought what, who works where, or geographical part-of relationships. This is ideal for researchers, data analysts, or content managers who need to quickly pull specific facts from large volumes of text.
No commits in the last 6 months.
Use this if you need to automatically identify and extract specific types of events or relationships (like purchases, employment, or geographical inclusions) from a collection of text documents.
Not ideal if you need to extract highly nuanced or subjective information, or if your required templates deviate significantly from the 'BUY', 'WORK', and 'PART' structures provided.
Stars
41
Forks
7
Language
Jupyter Notebook
License
—
Category
Last pushed
Jun 23, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/prit2596/NLP-Template-Extraction"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ziqizhang/jate
JATE - Just Automatic Term Extraction (in Python)
mcs07/ChemDataExtractor
Automatically extract chemical information from scientific documents
brucewlee/lftk
[BEA @ ACL 2023] General-purpose tool for linguistic features extraction; Tested on readability...
mmmaurer/elfen
A python package to efficiently extract linguistic features for text/NLP datasets
strangetom/ingredient-parser
A tool to parse recipe ingredients into structured data