mikahama/uralicNLP
An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanish, French, Arabic, Swedish, Norwegian, Russian and English. LLMs, FSTs and More!
UralicNLP helps linguists, researchers, and language enthusiasts analyze and generate text in a wide range of languages, including many less-resourced Uralic languages. You can input words or phrases to get detailed morphological analyses, find base forms (lemmas), or generate specific word forms from a base and desired grammatical features. The output provides structured linguistic information, making it easier to understand word structure and meaning.
Available on PyPI.
Use this if you need to perform detailed linguistic analysis, lemmatization, or morphological generation for texts in various languages, especially Uralic ones, without building complex models from scratch.
Not ideal if your primary goal is general-purpose, high-volume text analysis like sentiment analysis or topic modeling for common languages, as more specialized tools might offer broader feature sets for those tasks.
Stars
93
Forks
7
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 12, 2026
Commits (30d)
0
Dependencies
8
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/mikahama/uralicNLP"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
SkyworkAI/Skywork
Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and...
gia-uh/lingo
A Python library for context engineering.
shamspias/lexsublm-lite
A laptop‑friendly toolkit for context‑aware single‑word paraphrasing and lexical‑substitution...
AragonerUA/SampoNLP
A corpus-free toolkit for morphological lexicon creation and tokenizer evaluation using...
jiangnanboy/llm_corpus_quality
大模型预训练中文语料清洗及质量评估 Large model pre-training corpus cleaning