xlang-ai/instructor-embedding
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
This tool helps anyone working with text data to get better insights by converting plain text into numerical representations. You input your text along with a clear instruction about its purpose (e.g., "Represent this legal document for contract review"), and it outputs tailored numerical embeddings. Researchers, data analysts, or anyone needing to analyze large volumes of text for specific tasks would find this valuable.
2,023 stars. No commits in the last 6 months.
Use this if you need to transform text into numerical data for tasks like classifying documents, finding similar articles, or grouping related content, without needing to fine-tune a model for each new task.
Not ideal if you only need a generic text representation and don't require the flexibility of specifying a task or domain.
Stars
2,023
Forks
156
Language
Python
License
Apache-2.0
Category
Last pushed
Jan 15, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/xlang-ai/instructor-embedding"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ContextualAI/gritlm
Generative Representational Instruction Tuning
liuqidong07/LLMEmb
[AAAI'25 Oral] The official implementation code of LLMEmb
hpcaitech/CachedEmbedding
A memory efficient DLRM training solution using ColossalAI
ritesh-modi/embedding-hallucinations
This repo shows how foundational model hallucinates and how we can fix such hallucinations using...
ritesh-modi/fine-tuning-embeddings-template
This repo is a template to fine-tune embedding models using sentencetransformers based on...