smileart/lemmingo
Defensive lemmatiser/stemmer written in Go ⊂( ⚆ ϖ⚆)っ
This tool helps analyze text by reducing words to their base form, whether that's a lemma (the dictionary form) or a stem. It takes individual words and their grammatical role (like noun or adjective) and provides the corrected base form. Data scientists, linguists, or anyone performing natural language processing on text can use this to standardize words for better analysis.
No commits in the last 6 months.
Use this if you need to accurately reduce words to their base forms for text analysis, especially when dealing with potentially out-of-vocabulary words or different grammatical contexts.
Not ideal if you need a full-featured text processing library that handles tokenization, named-entity recognition, or dependency parsing in addition to lemmatization.
Stars
13
Forks
2
Language
Go
License
MIT
Category
Last pushed
May 22, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/smileart/lemmingo"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ikawaha/kagome-dict
Dictionary Library for Kagome v2
aaaton/golem
A lemmatizer implemented in Go
habeanf/yap
Yet Another (natural language) Parser
clipperhouse/uax29
A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split graphemes, words, sentences.
abadojack/whatlanggo
Natural language detection library for Go