damzaky/sastrawijs
Indonesian language stemmer. Javascript port of PHP Sastrawi project.
This tool helps you process Indonesian text by transforming words with prefixes and suffixes into their root forms. You provide Indonesian sentences, and it returns a list of the base words, making it easier to analyze the core meaning. It's ideal for linguists, data analysts, or researchers working with large volumes of Indonesian language data.
No commits in the last 6 months. Available on npm.
Use this if you need to standardize Indonesian words to their basic forms for tasks like search, text analysis, or natural language processing.
Not ideal if you're working with languages other than Indonesian or if you need more advanced linguistic analysis beyond basic stemming.
Stars
38
Forks
10
Language
TypeScript
License
MIT
Category
Last pushed
Mar 11, 2024
Commits (30d)
0
Dependencies
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/damzaky/sastrawijs"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
hplt-project/sacremoses
Python port of Moses tokenizer, truecaser and normalizer
Blake-Madden/OleanderStemmingLibrary
Porter stemming library (C++)
adbar/simplemma
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
htaghizadeh/PersianStemmer-Python
PersianStemmer-Python
michmech/lemmatization-lists
Machine-readable lists of lemma-token pairs in 23 languages.