worldbank/GISTEmbed

GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings

23
/ 100
Experimental

This project helps developers fine-tune text embedding models to better understand the relationships between pieces of text. It takes a collection of text data and an existing embedding model, then produces a more accurate, specialized embedding model. This is used by machine learning engineers or NLP researchers who want to improve the performance of their text-based AI applications.

No commits in the last 6 months.

Use this if you are a machine learning engineer or NLP researcher who needs to fine-tune a text embedding model for better performance on specific retrieval or classification tasks.

Not ideal if you are looking for a ready-to-use embedding model for general-purpose tasks without any custom fine-tuning.

text-embeddings NLP-model-tuning information-retrieval-engineering semantic-search-development
No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 8 / 25
Maturity 8 / 25
Community 7 / 25

How are scores calculated?

Stars

44

Forks

3

Language

Python

License

Last pushed

Mar 06, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/worldbank/GISTEmbed"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.