CogComp/talen

A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities

48
/ 100
Emerging

This tool helps linguists, researchers, and data scientists quickly label specific words or phrases in large text collections, even in languages they don't speak. You feed it raw text documents, and it helps you highlight and categorize important entities like names, locations, or organizations, saving your progress and outputting annotated text. It's designed for anyone needing to create high-quality labeled datasets for natural language processing.

119 stars. No commits in the last 6 months.

Use this if you need to manually identify and categorize specific entities within a large body of text, especially for languages with limited existing linguistic resources.

Not ideal if you need a fully automated solution for text analysis or if you are looking for a tool to perform general document review without specific entity labeling.

linguistics text-annotation natural-language-processing data-labeling entity-recognition
Stale 6m No Package No Dependents
Maintenance 2 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 20 / 25

How are scores calculated?

Stars

119

Forks

25

Language

Java

License

Last pushed

Jul 12, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/CogComp/talen"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.