writecrow/lemmatizer
A PHP library for getting a lemma from a given word, and getting a list of words that map to a lemma.
This tool helps content creators, linguists, or data analysts standardize words to their base form (lemma) or find all variations of a base word. You input a word like "running" and get its lemma "run," or input "run" to get a list like "running, ran, runs." This is useful for tasks like text analysis, search optimization, or building glossaries.
No commits in the last 6 months.
Use this if you need to quickly find the root form of a word or get a list of all related inflections without considering their grammatical role in a sentence.
Not ideal if your task requires differentiating word meanings based on their part of speech (e.g., "leaves" as a verb vs. "leaves" as a noun).
Stars
13
Forks
6
Language
PHP
License
MIT
Category
Last pushed
Nov 08, 2019
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/writecrow/lemmatizer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
hplt-project/sacremoses
Python port of Moses tokenizer, truecaser and normalizer
Blake-Madden/OleanderStemmingLibrary
Porter stemming library (C++)
adbar/simplemma
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
htaghizadeh/PersianStemmer-Python
PersianStemmer-Python
michmech/lemmatization-lists
Machine-readable lists of lemma-token pairs in 23 languages.