Alir3z4/python-stop-words
Get list of common stop words in various languages in Python
This tool helps developers clean and prepare text data for analysis by providing lists of common 'stop words' (like 'the', 'is', 'at') in over 34 languages. It takes a language code or name as input and outputs a list of words to exclude from your text. This is useful for anyone building applications that analyze or process text, such as data scientists, NLP engineers, or backend developers.
159 stars. Used by 7 other packages. Available on PyPI.
Use this if you need to efficiently remove common, less meaningful words from text across multiple languages to improve the focus of your text analysis.
Not ideal if you need highly specialized or domain-specific stop word lists that aren't covered by general language usage.
Stars
159
Forks
29
Language
Python
License
BSD-3-Clause
Category
Last pushed
Nov 03, 2025
Commits (30d)
0
Reverse dependents
7
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Alir3z4/python-stop-words"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
hklemp/dotnet-stop-words
Get list of common stop words in various languages in dotnet
igorbrigadir/stopwords
Default English stopword lists from many different sources
skupriienko/Ukrainian-Stopwords
the list of ~2000 ukrainian stopwords (with numbers)
stdlib-js/datasets-savoy-stopwords-fr
A list of French stop words.
eklem/stopword-trainer
A module for creating stopword lists for any language, based on a set of documents.