FinNLP/lemmatizer

📦 English word lemmatizer

/ 100

Emerging

Need to process English text for consistent analysis? This tool helps you transform various forms of a word (like "running," "ran," "runs") into its base form ("run"). This is useful for anyone working with textual data who needs to consolidate words for better search, comparison, or quantitative analysis.

17 stars and 3,733 monthly downloads. No commits in the last 6 months. Available on npm.

Use this if you need to standardize English words in text data to their root form, for example, when preparing text for sentiment analysis, topic modeling, or information retrieval.

Not ideal if you need to analyze the grammatical role or tense of words, as lemmatization focuses purely on the base lexical form.

text-analysis natural-language-processing data-cleaning information-retrieval linguistics

Stale 6m

Maintenance 0 / 25

Adoption 14 / 25

Maturity 25 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

TypeScript

License

MIT

Compare

lemmatizer and wink-lemmatizer

Higher-rated alternatives

hplt-project/sacremoses

Python port of Moses tokenizer, truecaser and normalizer

Blake-Madden/OleanderStemmingLibrary

Porter stemming library (C++)

adbar/simplemma

Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

htaghizadeh/PersianStemmer-Python

PersianStemmer-Python

michmech/lemmatization-lists

Machine-readable lists of lemma-token pairs in 23 languages.

Explore NLP Tools

All categories Trending NLP directory Insights