KGCP/MEL-TNNT

Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)

/ 100

Experimental

This project helps researchers and knowledge managers automatically extract key information from a wide variety of documents. You input individual files like PDFs, Word documents, or emails, and it outputs a structured JSON file containing metadata, raw text, and identified entities like people, organizations, and dates. It's designed for anyone needing to quickly summarize or categorize content from large collections of diverse documents.

No commits in the last 6 months.

Use this if you need to systematically pull out specific facts and details from an assortment of files to create a searchable and organized knowledge base.

Not ideal if you only need simple text extraction or if your documents are all in a single, consistent format that doesn't require deep entity recognition.

knowledge-management document-analysis information-extraction research-data-processing content-categorization

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 4 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

MantisAI/nervaluate

Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13

dice-group/gerbil

GERBIL - General Entity annotatoR Benchmark

bltlab/seqscore

SeqScore: Scoring for named entity recognition and other sequence labeling tasks

syuoni/eznlp

Easy Natural Language Processing

LHNCBC/metamaplite

A near real-time named-entity recognizer

Explore NLP Tools

All categories Trending NLP directory Insights