CogComp/talen
A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities
This tool helps linguists, researchers, and data scientists quickly label specific words or phrases in large text collections, even in languages they don't speak. You feed it raw text documents, and it helps you highlight and categorize important entities like names, locations, or organizations, saving your progress and outputting annotated text. It's designed for anyone needing to create high-quality labeled datasets for natural language processing.
119 stars. No commits in the last 6 months.
Use this if you need to manually identify and categorize specific entities within a large body of text, especially for languages with limited existing linguistic resources.
Not ideal if you need a fully automated solution for text analysis or if you are looking for a tool to perform general document review without specific entity labeling.
Stars
119
Forks
25
Language
Java
License
—
Category
Last pushed
Jul 12, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/CogComp/talen"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
MantisAI/nervaluate
Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13
dice-group/gerbil
GERBIL - General Entity annotatoR Benchmark
bltlab/seqscore
SeqScore: Scoring for named entity recognition and other sequence labeling tasks
syuoni/eznlp
Easy Natural Language Processing
LHNCBC/metamaplite
A near real-time named-entity recognizer