embeddings-benchmark/mteb

MTEB: Massive Text Embedding Benchmark

/ 100

Verified

This tool helps machine learning engineers and researchers assess the quality and performance of different text embedding models. You provide a text embedding model and specific evaluation tasks (like text classification or retrieval). The output is a clear set of metrics showing how well the model performs on those tasks, allowing for informed comparison and selection of the best model.

3,159 stars. Used by 6 other packages. Actively maintained with 107 commits in the last 30 days. Available on PyPI.

Use this if you need to systematically compare and evaluate multiple text embedding models against standardized benchmarks to choose the most effective one for your application.

Not ideal if you are looking for a tool to train new text embedding models or to apply embeddings directly in a production system without prior evaluation.

natural-language-processing model-evaluation text-embeddings information-retrieval machine-learning-research

Maintenance 22 / 25

Adoption 15 / 25

Maturity 25 / 25

Community 24 / 25

How are scores calculated?

Stars

3,159

Forks

568

Language

Python

License

Apache-2.0

Featured in

Embeddings Are Easier Than Whatever You're Doing Instead You're Shipping AI You Can't Measure

Compare

mteb and results mteb and mleb

Related tools

harmonydata/harmony

The Harmony Python library: a research tool for psychologists to harmonise data and...

yannvgn/laserembeddings

LASER multilingual sentence embeddings as a pip package

embeddings-benchmark/results

Data for the MTEB leaderboard

Hironsan/awesome-embedding-models

A curated list of awesome embedding models tutorials, projects and communities.

fresh-stack/freshstack

This repository helps you evaluate your models on the FreshStack benchmark!

Explore Embedding Tools

All categories Trending Embeddings directory Insights