smileart/lemmingo

Defensive lemmatiser/stemmer written in Go ⊂( ⚆ ϖ⚆)っ

/ 100

Emerging

This tool helps analyze text by reducing words to their base form, whether that's a lemma (the dictionary form) or a stem. It takes individual words and their grammatical role (like noun or adjective) and provides the corrected base form. Data scientists, linguists, or anyone performing natural language processing on text can use this to standardize words for better analysis.

No commits in the last 6 months.

Use this if you need to accurately reduce words to their base forms for text analysis, especially when dealing with potentially out-of-vocabulary words or different grammatical contexts.

Not ideal if you need a full-featured text processing library that handles tokenization, named-entity recognition, or dependency parsing in addition to lemmatization.

natural-language-processing text-analysis linguistics information-retrieval computational-linguistics

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

License

MIT

Higher-rated alternatives

ikawaha/kagome-dict

Dictionary Library for Kagome v2

aaaton/golem

A lemmatizer implemented in Go

habeanf/yap

Yet Another (natural language) Parser

clipperhouse/uax29

A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split graphemes, words, sentences.

abadojack/whatlanggo

Natural language detection library for Go

Explore NLP Tools

All categories Trending NLP directory Insights