aaaton/golem
A lemmatizer implemented in Go
This tool helps analyze text by reducing different forms of a word to its base form, like changing "aligning" to "align" or "sprungit" to "springa". You input words or text in a supported language, and it returns the standardized base form of each word. This is useful for anyone working with textual data across various languages who needs to ensure consistent analysis, like linguists, data scientists, or content managers.
No commits in the last 6 months.
Use this if you need to process text in English, Swedish, French, Spanish, Italian, or German and standardize words to their root form for analysis or search.
Not ideal if you need a stemming tool that simply chops off word endings, or if you require lemmatization for a language not currently supported.
Stars
93
Forks
20
Language
Go
License
MIT
Category
Last pushed
May 09, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/aaaton/golem"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ikawaha/kagome-dict
Dictionary Library for Kagome v2
habeanf/yap
Yet Another (natural language) Parser
clipperhouse/uax29
A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split graphemes, words, sentences.
jdkato/prose
:book: A Golang library for text processing, including tokenization, part-of-speech tagging, and...
abadojack/whatlanggo
Natural language detection library for Go