FinNLP/lemmatizer
📦 English word lemmatizer
Need to process English text for consistent analysis? This tool helps you transform various forms of a word (like "running," "ran," "runs") into its base form ("run"). This is useful for anyone working with textual data who needs to consolidate words for better search, comparison, or quantitative analysis.
17 stars and 3,733 monthly downloads. No commits in the last 6 months. Available on npm.
Use this if you need to standardize English words in text data to their root form, for example, when preparing text for sentiment analysis, topic modeling, or information retrieval.
Not ideal if you need to analyze the grammatical role or tense of words, as lemmatization focuses purely on the base lexical form.
Stars
17
Forks
1
Language
TypeScript
License
MIT
Category
Last pushed
May 03, 2022
Monthly downloads
3,733
Commits (30d)
0
Dependencies
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/FinNLP/lemmatizer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
hplt-project/sacremoses
Python port of Moses tokenizer, truecaser and normalizer
Blake-Madden/OleanderStemmingLibrary
Porter stemming library (C++)
adbar/simplemma
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
htaghizadeh/PersianStemmer-Python
PersianStemmer-Python
michmech/lemmatization-lists
Machine-readable lists of lemma-token pairs in 23 languages.