CLARIN-PL/embeddings
Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
This library helps researchers and NLP practitioners working with the Polish language to quickly build and evaluate models for tasks like sentiment analysis, named entity recognition, or part-of-speech tagging. You provide your Polish text data, and the library outputs a trained classification or labeling model. It's designed for those who need to process and understand Polish text efficiently.
No commits in the last 6 months.
Use this if you are developing or evaluating Natural Language Processing solutions specifically for the Polish language, such as classifying customer reviews or identifying key entities in text.
Not ideal if your primary focus is on languages other than Polish, as its initial version is tailored for Polish text representations.
Stars
37
Forks
3
Language
Python
License
MIT
Category
Last pushed
Dec 03, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/CLARIN-PL/embeddings"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
dselivanov/text2vec
Fast vectorization, topic modeling, distances and GloVe word embeddings in R.
vzhong/embeddings
Fast, DB Backed pretrained word embeddings for natural language processing.
dccuchile/spanish-word-embeddings
Spanish word embeddings computed with different methods and from different corpora
ncbi-nlp/BioSentVec
BioWordVec & BioSentVec: pre-trained embeddings for biomedical words and sentences
ibrahimsharaf/doc2vec
:notebook: Long(er) text representation and classification using Doc2Vec embeddings