natasha/yargy
Rule-based facts extraction for Russian language
This project helps you automatically extract specific pieces of information from Russian text, like names, job titles, or other key facts. You provide text written in Russian and define the patterns of information you're looking for, and it returns structured data with those facts clearly identified. It's designed for data analysts, researchers, or anyone needing to pull specific data points from large volumes of Russian documents.
330 stars. Used by 1 other package. No commits in the last 6 months. Available on PyPI.
Use this if you need to reliably find and extract specific, structured information (like people's names and positions) from Russian language texts.
Not ideal if you're working with languages other than Russian or if you need to understand the general sentiment or overall topic of a text rather than discrete facts.
Stars
330
Forks
43
Language
Python
License
MIT
Category
Last pushed
Jul 24, 2023
Commits (30d)
0
Dependencies
1
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/natasha/yargy"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
nert-nlp/streusle
STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)
bretttolbert/verbecc
Verbe Complete Conjugator (verbecc) supports Catalan, Spanish, French, Italian, Portuguese and...
google-research/turkish-morphology
A two-level morphological analyzer for Turkish.
bjascob/LemmInflect
A python module for English lemmatization and inflection.
IlyaGusev/rnnmorph
Morphological analyzer for Russian and English languages based on neural networks and...