xiamx/lemma

A Morphological Parser (Analyser) / Lemmatizer written in Elixir.

/ 100

Emerging

This tool helps linguists, researchers, or anyone working with text to reduce different forms of a word to its common base. For example, 'organizes' and 'organizing' both become 'organize'. You input text with varied word forms, and it outputs the text with words transformed to their base (lemma) form.

Use this if you need to process text for linguistic analysis or search applications where understanding the root form of words is important, especially for non-production, exploratory work.

Not ideal if you need a high-performance, CPU-efficient, or memory-efficient solution for large-scale production text processing.

linguistics text-analysis natural-language-processing information-retrieval corpus-linguistics

No Package No Dependents

Maintenance 10 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

Elixir

License

Apache-2.0

Higher-rated alternatives

hplt-project/sacremoses

Python port of Moses tokenizer, truecaser and normalizer

Blake-Madden/OleanderStemmingLibrary

Porter stemming library (C++)

adbar/simplemma

Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

htaghizadeh/PersianStemmer-Python

PersianStemmer-Python

michmech/lemmatization-lists

Machine-readable lists of lemma-token pairs in 23 languages.

Explore NLP Tools

All categories Trending NLP directory Insights