CogComp/talen

A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities

/ 100

Emerging

This tool helps linguists, researchers, and data scientists quickly label specific words or phrases in large text collections, even in languages they don't speak. You feed it raw text documents, and it helps you highlight and categorize important entities like names, locations, or organizations, saving your progress and outputting annotated text. It's designed for anyone needing to create high-quality labeled datasets for natural language processing.

119 stars. No commits in the last 6 months.

Use this if you need to manually identify and categorize specific entities within a large body of text, especially for languages with limited existing linguistic resources.

Not ideal if you need a fully automated solution for text analysis or if you are looking for a tool to perform general document review without specific entity labeling.

linguistics text-annotation natural-language-processing data-labeling entity-recognition

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 20 / 25

How are scores calculated?

Stars

119

Forks

Language

Java

License

—

Higher-rated alternatives

MantisAI/nervaluate

Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13

dice-group/gerbil

GERBIL - General Entity annotatoR Benchmark

bltlab/seqscore

SeqScore: Scoring for named entity recognition and other sequence labeling tasks

syuoni/eznlp

Easy Natural Language Processing

LHNCBC/metamaplite

A near real-time named-entity recognizer

Explore NLP Tools

All categories Trending NLP directory Insights