marcusklang/docforia

Semistructured Multilayer Document Model

21
/ 100
Experimental

This tool helps natural language processing (NLP) developers represent and query complex linguistic information within a document. It takes raw text as input and allows you to add layers of annotations like tokens, sentences, and named entities, enabling sophisticated text analysis. It's designed for software engineers building NLP applications who need to manage and query structured data from unstructured text.

No commits in the last 6 months.

Use this if you are developing NLP applications and need a flexible way to store, organize, and query multiple layers of annotations (like words, sentences, or custom tags) on a text document.

Not ideal if you are an end-user simply looking for an out-of-the-box NLP solution, or if your primary need is basic text search without complex linguistic structure.

natural-language-processing text-analysis linguistic-annotation information-extraction data-modeling
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 0 / 25

How are scores calculated?

Stars

9

Forks

Language

Java

License

Apache-2.0

Last pushed

Sep 14, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/marcusklang/docforia"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.