Paulescu/text-embedding-evaluation

Join 15k builders to the Real-World ML Newsletter ⬇️⬇️⬇️

23
/ 100
Experimental

This tool helps machine learning practitioners or data scientists choose the best text embedding model for their specific retrieval tasks. You input your dataset of questions, contexts, and correct answers, along with a list of candidate embedding models. The tool outputs an evaluation of how well each model retrieves relevant information, helping you identify the top performer for your application.

No commits in the last 6 months.

Use this if you need to reliably find the most relevant documents for a given query and want to ensure you're using the optimal text embedding model for that purpose.

Not ideal if your primary use case for embeddings is classification or clustering, rather than information retrieval.

information-retrieval machine-learning-engineering text-analytics data-science document-search
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 6 / 25
Maturity 8 / 25
Community 9 / 25

How are scores calculated?

Stars

18

Forks

2

Language

Python

License

Last pushed

Apr 19, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/Paulescu/text-embedding-evaluation"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.