htaghizadeh/PersianStemmer-Python

PersianStemmer-Python

/ 100

Established

When analyzing Persian text, this tool helps you normalize words by reducing them to their root or stem form. This process takes various inflected or derived forms of a word (like plurals or conjugated verbs) and returns a consistent base form. It's useful for anyone working with Persian language data, such as researchers in linguistics, digital humanities, or anyone building search engines or text analysis tools for Persian.

No commits in the last 6 months. Available on PyPI.

Use this if you need to prepare Persian text for analysis, search, or comparison by standardizing word forms.

Not ideal if you are working with languages other than Persian or require full lemmatization (reducing words to their dictionary form, which can be more complex than stemming).

Persian-language-processing text-normalization linguistic-analysis information-retrieval digital-humanities

Stale 6m No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 25 / 25

Community 18 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

BSD-2-Clause

Related tools

hplt-project/sacremoses

Python port of Moses tokenizer, truecaser and normalizer

Blake-Madden/OleanderStemmingLibrary

Porter stemming library (C++)

adbar/simplemma

Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

michmech/lemmatization-lists

Machine-readable lists of lemma-token pairs in 23 languages.

winkjs/wink-porter2-stemmer

Javascript Implementation of Porter Stemmer Algorithm V2 by Dr Martin F Porter

Explore NLP Tools

All categories Trending NLP directory Insights