recogito/tei-standoffconverter-js
Convert between TEI/XML and plaintext without losing markup context.
This tool helps researchers and digital humanists enrich their TEI/XML documents using modern text analysis software. It takes a TEI/XML document, converts it to plain text for analysis, and then maps any new annotations (like identified entities or key terms) back into the original TEI/XML structure without losing existing markup. It's designed for scholars, librarians, and archivists working with historical texts or complex document structures.
No commits in the last 6 months.
Use this if you need to apply text analysis or named entity recognition tools to TEI/XML documents and seamlessly integrate the results back into the original XML markup.
Not ideal if your documents are not in TEI/XML format or if you primarily work with plain text that doesn't require maintaining complex markup structures.
Stars
9
Forks
—
Language
TypeScript
License
MIT
Category
Last pushed
Jun 25, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/recogito/tei-standoffconverter-js"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
spencermountain/compromise
modest natural-language processing
textlint/textlint
textlint is the pluggable linter for natural language text.
ChristianMurphy/classify-poetry
recognize type poetry in a given text excerpt
Planeshifter/text-miner
text mining utilities for Node.js
aholstenson/ecolect-js
Natural language handling for commands and intents