kampsy/gwizo
Simple Go implementation of the Porter Stemmer algorithm with powerful features.
This project helps Go developers reduce words to their base or root form, like changing "abilities" to "able." It takes individual words or a list of words as input and outputs their shortened "stemmed" versions. Developers working on applications that process and analyze text will find this useful for tasks like search engines or content categorization.
No commits in the last 6 months.
Use this if you are a Go developer building a search engine, text analysis tool, or any application that needs to normalize words to their root form to improve data consistency and retrieval.
Not ideal if you need to understand the full grammatical context or meaning of a word, as stemming only strips suffixes and doesn't consider true linguistic roots.
Stars
27
Forks
2
Language
Go
License
—
Category
Last pushed
Jun 03, 2021
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/kampsy/gwizo"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
hplt-project/sacremoses
Python port of Moses tokenizer, truecaser and normalizer
Blake-Madden/OleanderStemmingLibrary
Porter stemming library (C++)
adbar/simplemma
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
htaghizadeh/PersianStemmer-Python
PersianStemmer-Python
michmech/lemmatization-lists
Machine-readable lists of lemma-token pairs in 23 languages.