htaghizadeh/PersianStemmer-Python
PersianStemmer-Python
When analyzing Persian text, this tool helps you normalize words by reducing them to their root or stem form. This process takes various inflected or derived forms of a word (like plurals or conjugated verbs) and returns a consistent base form. It's useful for anyone working with Persian language data, such as researchers in linguistics, digital humanities, or anyone building search engines or text analysis tools for Persian.
No commits in the last 6 months. Available on PyPI.
Use this if you need to prepare Persian text for analysis, search, or comparison by standardizing word forms.
Not ideal if you are working with languages other than Persian or require full lemmatization (reducing words to their dictionary form, which can be more complex than stemming).
Stars
52
Forks
14
Language
Python
License
BSD-2-Clause
Category
Last pushed
Jul 04, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/htaghizadeh/PersianStemmer-Python"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
hplt-project/sacremoses
Python port of Moses tokenizer, truecaser and normalizer
Blake-Madden/OleanderStemmingLibrary
Porter stemming library (C++)
adbar/simplemma
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
michmech/lemmatization-lists
Machine-readable lists of lemma-token pairs in 23 languages.
winkjs/wink-porter2-stemmer
Javascript Implementation of Porter Stemmer Algorithm V2 by Dr Martin F Porter