recogito/tei-standoffconverter-js

Convert between TEI/XML and plaintext without losing markup context.

22
/ 100
Experimental

This tool helps researchers and digital humanists enrich their TEI/XML documents using modern text analysis software. It takes a TEI/XML document, converts it to plain text for analysis, and then maps any new annotations (like identified entities or key terms) back into the original TEI/XML structure without losing existing markup. It's designed for scholars, librarians, and archivists working with historical texts or complex document structures.

No commits in the last 6 months.

Use this if you need to apply text analysis or named entity recognition tools to TEI/XML documents and seamlessly integrate the results back into the original XML markup.

Not ideal if your documents are not in TEI/XML format or if you primarily work with plain text that doesn't require maintaining complex markup structures.

digital-humanities philology text-encoding-initiative historical-research archive-management
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 5 / 25
Maturity 15 / 25
Community 0 / 25

How are scores calculated?

Stars

9

Forks

Language

TypeScript

License

MIT

Last pushed

Jun 25, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/recogito/tei-standoffconverter-js"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.