adbar/simplemma
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
This tool helps you convert words in different languages into their base or dictionary form. For example, it can change 'masks' to 'mask' or 'angekündigten' to 'ankündigen' in German. This is useful for researchers, data analysts, or anyone working with text in multiple languages who needs to analyze words consistently, regardless of their grammatical variations.
188 stars. Used by 2 other packages. No commits in the last 6 months. Available on PyPI.
Use this if you need a quick, simple, and efficient way to standardize words to their base forms across many languages without complex linguistic analysis.
Not ideal if you require highly nuanced linguistic analysis that considers parts of speech or precise disambiguation between words with identical forms but different meanings.
Stars
188
Forks
15
Language
Python
License
MIT
Category
Last pushed
Jun 06, 2025
Commits (30d)
0
Reverse dependents
2
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/adbar/simplemma"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
hplt-project/sacremoses
Python port of Moses tokenizer, truecaser and normalizer
Blake-Madden/OleanderStemmingLibrary
Porter stemming library (C++)
htaghizadeh/PersianStemmer-Python
PersianStemmer-Python
michmech/lemmatization-lists
Machine-readable lists of lemma-token pairs in 23 languages.
winkjs/wink-porter2-stemmer
Javascript Implementation of Porter Stemmer Algorithm V2 by Dr Martin F Porter