rafalposwiata/pl-mteb

PL-MTEB: Polish Massive Text Embedding Benchmark

27
/ 100
Experimental

This project helps machine learning engineers and researchers evaluate how well different language models understand and process Polish text. It takes in a pre-trained Polish text embedding model and outputs a comprehensive performance score across various tasks like text classification, clustering, and semantic search, allowing users to compare models efficiently. It's designed for those who build or deploy NLP solutions for the Polish language.

Use this if you need to rigorously compare the effectiveness of different text embedding models for Polish language applications.

Not ideal if you are working with languages other than Polish or if you need a tool for training new text embedding models from scratch.

Polish NLP model evaluation text embedding natural language processing machine learning research
No License No Package No Dependents
Maintenance 6 / 25
Adoption 5 / 25
Maturity 8 / 25
Community 8 / 25

How are scores calculated?

Stars

9

Forks

1

Language

Python

License

Last pushed

Dec 18, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/rafalposwiata/pl-mteb"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.