xiamx/lemma
A Morphological Parser (Analyser) / Lemmatizer written in Elixir.
This tool helps linguists, researchers, or anyone working with text to reduce different forms of a word to its common base. For example, 'organizes' and 'organizing' both become 'organize'. You input text with varied word forms, and it outputs the text with words transformed to their base (lemma) form.
Use this if you need to process text for linguistic analysis or search applications where understanding the root form of words is important, especially for non-production, exploratory work.
Not ideal if you need a high-performance, CPU-efficient, or memory-efficient solution for large-scale production text processing.
Stars
50
Forks
2
Language
Elixir
License
Apache-2.0
Category
Last pushed
Jan 22, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/xiamx/lemma"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
hplt-project/sacremoses
Python port of Moses tokenizer, truecaser and normalizer
Blake-Madden/OleanderStemmingLibrary
Porter stemming library (C++)
adbar/simplemma
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
htaghizadeh/PersianStemmer-Python
PersianStemmer-Python
michmech/lemmatization-lists
Machine-readable lists of lemma-token pairs in 23 languages.