rafalposwiata/pl-mteb
PL-MTEB: Polish Massive Text Embedding Benchmark
This project helps machine learning engineers and researchers evaluate how well different language models understand and process Polish text. It takes in a pre-trained Polish text embedding model and outputs a comprehensive performance score across various tasks like text classification, clustering, and semantic search, allowing users to compare models efficiently. It's designed for those who build or deploy NLP solutions for the Polish language.
Use this if you need to rigorously compare the effectiveness of different text embedding models for Polish language applications.
Not ideal if you are working with languages other than Polish or if you need a tool for training new text embedding models from scratch.
Stars
9
Forks
1
Language
Python
License
—
Category
Last pushed
Dec 18, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/rafalposwiata/pl-mteb"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Higher-rated alternatives
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
harmonydata/harmony
The Harmony Python library: a research tool for psychologists to harmonise data and...
yannvgn/laserembeddings
LASER multilingual sentence embeddings as a pip package
embeddings-benchmark/results
Data for the MTEB leaderboard
Hironsan/awesome-embedding-models
A curated list of awesome embedding models tutorials, projects and communities.