d0rj/RusLit
π A small collection of Russian literature π
This collection provides a ready-to-use set of Russian literary works in plain text format, perfect for anyone conducting text-based research or analysis. You get individual book texts and accompanying metadata (like author and publication year) that can be used directly in your research tools. It's ideal for literary scholars, computational linguists, or data scientists working with historical texts.
No commits in the last 6 months.
Use this if you need a pre-compiled, structured dataset of Russian literature for text analysis, natural language processing, or digital humanities research.
Not ideal if you're looking for critical editions, annotated texts, or a platform for reading literature.
Stars
13
Forks
4
Language
—
License
—
Category
Last pushed
Dec 09, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/d0rj/RusLit"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Helsinki-NLP/OpusFilter
OpusFilter - Parallel corpus processing toolkit
natasha/corus
Links to Russian corpora + Python functions for loading and parsing
SergeyShk/ruTS
ΠΠΈΠ±Π»ΠΈΠΎΡΠ΅ΠΊΠ° Π΄Π»Ρ ΠΈΠ·Π²Π»Π΅ΡΠ΅Π½ΠΈΡ ΡΡΠ°ΡΠΈΡΡΠΈΠΊ ΠΈΠ· ΡΠ΅ΠΊΡΡΠΎΠ² Π½Π° ΡΡΡΡΠΊΠΎΠΌ ΡΠ·ΡΠΊΠ΅.
darija-open-dataset/dataset
darija <-> english dataset
omicsNLP/Auto-CORPus
Auto-CORPus pipeline developed by a University of Nottingham and Imperial College London...