JhaAyush01/Embedding-Quantization-for-Significantly-Faster-Cheaper-Retrieval
Unofficial Implementation of Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval and Evaluation of RAG system using "SEMALEX" evaluation metric .
No commits in the last 6 months.
Stars
—
Forks
—
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Aug 17, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/JhaAyush01/Embedding-Quantization-for-Significantly-Faster-Cheaper-Retrieval"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
xhluca/bm25s
Fast lexical search implementing BM25 in Python
ALucek/QuicKB
Optimize Document Retrieval with Fine-Tuned KnowledgeBases
Rohith-2/bm25-fusion
An ultra-fast BM25 retriever with support for multiple variants and meta-data filtering.
analyticsinmotion/symrank
🐍📦 High-performance cosine similarity ranking for Retrieval-Augmented Generation (RAG) pipelines.
MukundaKatta/HybridFind
Hybrid semantic + keyword search — BM25 and vector similarity with Reciprocal Rank Fusion