KGCP/MEL-TNNT
Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)
This project helps researchers and knowledge managers automatically extract key information from a wide variety of documents. You input individual files like PDFs, Word documents, or emails, and it outputs a structured JSON file containing metadata, raw text, and identified entities like people, organizations, and dates. It's designed for anyone needing to quickly summarize or categorize content from large collections of diverse documents.
No commits in the last 6 months.
Use this if you need to systematically pull out specific facts and details from an assortment of files to create a searchable and organized knowledge base.
Not ideal if you only need simple text extraction or if your documents are all in a single, consistent format that doesn't require deep entity recognition.
Stars
25
Forks
1
Language
Python
License
—
Category
Last pushed
Mar 13, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/KGCP/MEL-TNNT"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
MantisAI/nervaluate
Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13
dice-group/gerbil
GERBIL - General Entity annotatoR Benchmark
bltlab/seqscore
SeqScore: Scoring for named entity recognition and other sequence labeling tasks
syuoni/eznlp
Easy Natural Language Processing
LHNCBC/metamaplite
A near real-time named-entity recognizer