xiamx/gen_fst
Elixir module that implements a generic finite state transducer with customizable rules expressed in a DSL.
This tool helps language specialists process and understand text by automatically breaking down words into their core components. You input a list of words, and it outputs their base forms and grammatical markers. It's ideal for linguists, natural language researchers, or anyone needing to analyze word structures systematically.
Use this if you need to translate or relate strings based on defined rules, such as transforming inflected words into their root forms and identifying their grammatical suffixes.
Not ideal if you need a pre-trained, ready-to-use natural language processing tool for complex tasks like sentiment analysis or machine translation without defining explicit rules.
Stars
9
Forks
1
Language
Elixir
License
Apache-2.0
Category
Last pushed
Jan 22, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/xiamx/gen_fst"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
hplt-project/sacremoses
Python port of Moses tokenizer, truecaser and normalizer
Blake-Madden/OleanderStemmingLibrary
Porter stemming library (C++)
adbar/simplemma
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
htaghizadeh/PersianStemmer-Python
PersianStemmer-Python
michmech/lemmatization-lists
Machine-readable lists of lemma-token pairs in 23 languages.