aaaton/golem

A lemmatizer implemented in Go

/ 100

Emerging

This tool helps analyze text by reducing different forms of a word to its base form, like changing "aligning" to "align" or "sprungit" to "springa". You input words or text in a supported language, and it returns the standardized base form of each word. This is useful for anyone working with textual data across various languages who needs to ensure consistent analysis, like linguists, data scientists, or content managers.

No commits in the last 6 months.

Use this if you need to process text in English, Swedish, French, Spanish, Italian, or German and standardize words to their root form for analysis or search.

Not ideal if you need a stemming tool that simply chops off word endings, or if you require lemmatization for a language not currently supported.

text-analysis natural-language-processing linguistics information-retrieval data-preparation

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

Forks

Language

License

MIT

Higher-rated alternatives

ikawaha/kagome-dict

Dictionary Library for Kagome v2

habeanf/yap

Yet Another (natural language) Parser

clipperhouse/uax29

A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split graphemes, words, sentences.

jdkato/prose

:book: A Golang library for text processing, including tokenization, part-of-speech tagging, and...

abadojack/whatlanggo

Natural language detection library for Go

Explore NLP Tools

All categories Trending NLP directory Insights