SergeyShk/ruTS
Библиотека для извлечения статистик из текстов на русском языке.
This tool helps analyze Russian texts to understand their structural properties, readability, and vocabulary richness. You input Russian text, and it outputs detailed statistics like word counts, sentence lengths, and various readability scores. This is ideal for linguists, educators, content creators, or researchers who need to quantify aspects of Russian language content.
125 stars. No commits in the last 6 months. Available on PyPI.
Use this if you need to systematically analyze Russian text for its complexity, stylistic features, or lexical diversity.
Not ideal if you are looking for sentiment analysis, topic modeling, or translation services.
Stars
125
Forks
21
Language
Python
License
MIT
Category
Last pushed
Jan 21, 2023
Commits (30d)
0
Dependencies
9
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/SergeyShk/ruTS"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
Helsinki-NLP/OpusFilter
OpusFilter - Parallel corpus processing toolkit
natasha/corus
Links to Russian corpora + Python functions for loading and parsing
darija-open-dataset/dataset
darija <-> english dataset
omicsNLP/Auto-CORPus
Auto-CORPus pipeline developed by a University of Nottingham and Imperial College London...
texttechnologylab/GerParCor
German Parliamentary Corpus (GerParCor)