rafalposwiata/pl-mteb

PL-MTEB: Polish Massive Text Embedding Benchmark

/ 100

Experimental

This project helps machine learning engineers and researchers evaluate how well different language models understand and process Polish text. It takes in a pre-trained Polish text embedding model and outputs a comprehensive performance score across various tasks like text classification, clustering, and semantic search, allowing users to compare models efficiently. It's designed for those who build or deploy NLP solutions for the Polish language.

Use this if you need to rigorously compare the effectiveness of different text embedding models for Polish language applications.

Not ideal if you are working with languages other than Polish or if you need a tool for training new text embedding models from scratch.

Polish NLP model evaluation text embedding natural language processing machine learning research

No License No Package No Dependents

Maintenance 6 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Featured in

Embeddings Are Easier Than Whatever You're Doing Instead You're Shipping AI You Can't Measure

Higher-rated alternatives

embeddings-benchmark/mteb

MTEB: Massive Text Embedding Benchmark

harmonydata/harmony

The Harmony Python library: a research tool for psychologists to harmonise data and...

yannvgn/laserembeddings

LASER multilingual sentence embeddings as a pip package

embeddings-benchmark/results

Data for the MTEB leaderboard

Hironsan/awesome-embedding-models

A curated list of awesome embedding models tutorials, projects and communities.

Explore Embedding Tools

All categories Trending Embeddings directory Insights