sahilfaizal01/Large-Scale-Vector-DB
This project focuses on developing a scalable vector database system optimized for dense passage retrieval, leveraging the MS-MARCO v2 dataset containing 8.84 million passages. Using advanced big data technologies and vector indexing algorithms, the system addresses challenges of retrieval latency and accuracy in large-scale datasets.
No commits in the last 6 months.
Stars
—
Forks
—
Language
Python
License
Apache-2.0
Category
Last pushed
Dec 21, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/vector-db/sahilfaizal01/Large-Scale-Vector-DB"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
lancedb/lancedb
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.
zilliztech/VectorDBBench
Benchmark for vector databases.
qdrant/vector-db-benchmark
Framework for benchmarking vector search engines
prrao87/lancedb-study
Comparing LanceDB and Elasticsearch for full-text search and vector search performance
vector-index-bench/vibe
Vector Index Benchmark for Embeddings (VIBE) is an extensible benchmark for approximate nearest...