mawo-ru/mawo-razdel
Продвинутая токенизация для русского языка с SynTagRus паттернами
This tool helps you accurately break down Russian texts into individual sentences and words. It takes raw Russian text, like news articles or literary works, and outputs clearly separated sentences and tokens (words, numbers, punctuation). Data analysts, linguists, or anyone working with Russian text for analysis or processing will find this useful.
Used by 1 other package. Available on PyPI.
Use this if you need to precisely segment Russian texts, especially those containing complex abbreviations, initials, direct speech, or decimal numbers, for natural language processing or text analysis.
Not ideal if you are working with languages other than Russian or if your text analysis tasks do not require advanced, highly accurate sentence and word segmentation.
Stars
8
Forks
—
Language
Python
License
—
Category
Last pushed
Nov 11, 2025
Commits (30d)
0
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/mawo-ru/mawo-razdel"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
nert-nlp/streusle
STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)
bretttolbert/verbecc
Verbe Complete Conjugator (verbecc) supports Catalan, Spanish, French, Italian, Portuguese and...
natasha/yargy
Rule-based facts extraction for Russian language
bjascob/LemmInflect
A python module for English lemmatization and inflection.
google-research/turkish-morphology
A two-level morphological analyzer for Turkish.