Alir3z4/python-stop-words

Get list of common stop words in various languages in Python

65
/ 100
Established

This tool helps developers clean and prepare text data for analysis by providing lists of common 'stop words' (like 'the', 'is', 'at') in over 34 languages. It takes a language code or name as input and outputs a list of words to exclude from your text. This is useful for anyone building applications that analyze or process text, such as data scientists, NLP engineers, or backend developers.

159 stars. Used by 7 other packages. Available on PyPI.

Use this if you need to efficiently remove common, less meaningful words from text across multiple languages to improve the focus of your text analysis.

Not ideal if you need highly specialized or domain-specific stop word lists that aren't covered by general language usage.

text-analysis natural-language-processing data-preprocessing information-retrieval multilingual-data
Maintenance 6 / 25
Adoption 15 / 25
Maturity 25 / 25
Community 19 / 25

How are scores calculated?

Stars

159

Forks

29

Language

Python

License

BSD-3-Clause

Last pushed

Nov 03, 2025

Commits (30d)

0

Reverse dependents

7

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Alir3z4/python-stop-words"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.