skupriienko/Ukrainian-Stopwords
the list of ~2000 ukrainian stopwords (with numbers)
This project provides a comprehensive list of nearly 2,000 common Ukrainian words, including numbers, that are typically filtered out during text analysis. It helps you prepare Ukrainian text for tasks like sentiment analysis, topic modeling, or information retrieval by removing words that don't carry significant meaning. Anyone working with Ukrainian text data, such as linguists, social media analysts, or researchers, would find this useful.
No commits in the last 6 months.
Use this if you need to clean and pre-process Ukrainian text to focus on the most important words for analysis.
Not ideal if your analysis requires retaining all words, including common connecting words and numbers, for specific linguistic research.
Stars
66
Forks
18
Language
Python
License
CC-BY-SA-4.0
Category
Last pushed
May 20, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/skupriienko/Ukrainian-Stopwords"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Alir3z4/python-stop-words
Get list of common stop words in various languages in Python
hklemp/dotnet-stop-words
Get list of common stop words in various languages in dotnet
igorbrigadir/stopwords
Default English stopword lists from many different sources
stdlib-js/datasets-savoy-stopwords-fr
A list of French stop words.
eklem/stopword-trainer
A module for creating stopword lists for any language, based on a set of documents.