AlexKly/Detailed-NER-Dataset-RU

Labeled Russian text token-by-token for training models for NER task based samples got from parsing different resources and generated by ChatGPT.

26
/ 100
Experimental

This dataset provides detailed, labeled Russian text, helping developers create or enhance Natural Language Processing (NLP) models. It takes raw Russian text as input and produces text annotated with specific entity types like locations (cities, countries, streets) and personal names (first, middle, last). NLP engineers and machine learning practitioners building Russian language applications would find this useful.

No commits in the last 6 months.

Use this if you need a high-quality, fine-grained dataset to train or improve models for extracting detailed entities from Russian text.

Not ideal if you only need general entity recognition (e.g., just 'PERSON' or 'LOCATION' without sub-types) or if you are working with languages other than Russian.

NLP Machine Learning Data Annotation Russian Language Processing Information Extraction
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 8 / 25
Community 13 / 25

How are scores calculated?

Stars

10

Forks

2

Language

Python

License

Last pushed

Jun 20, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/AlexKly/Detailed-NER-Dataset-RU"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.