sdadas/polish-nlp-resources
Pre-trained models and language resources for Natural Language Processing in Polish
This project offers a collection of pre-trained models and linguistic resources specifically for the Polish language. It takes raw Polish text and outputs numerical representations of words or entire texts, allowing for deeper analysis. Anyone working with Polish text data, such as researchers, linguists, or data scientists, would find this useful for tasks like understanding text similarity, classification, or machine translation.
369 stars.
Use this if you need to analyze, process, or understand large volumes of Polish text data for research or application development.
Not ideal if your primary focus is on languages other than Polish, or if you require models trained on very specific, niche Polish corpora not covered by general sources like Wikipedia and books.
Stars
369
Forks
33
Language
—
License
LGPL-3.0
Category
Last pushed
Mar 02, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/sdadas/polish-nlp-resources"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
nltk/nltk
NLTK Source
explosion/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
undertheseanlp/underthesea
Underthesea - Vietnamese NLP Toolkit
stanfordnlp/stanza
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many...
flairNLP/flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)