thustorage/GustANN

High-Throughput, Cost-Effective Billion-Scale Vector Search with a Single GPU [to appear in SIGMOD'26]

/ 100

Emerging

GustANN helps data scientists and machine learning engineers quickly find the most similar items (vectors) among billions of possibilities. You provide a dataset of numerical vectors and a query vector, and it efficiently returns the top 'k' most similar vectors, even with massive datasets that don't fit into GPU memory. This is ideal for applications requiring ultra-fast, large-scale similarity searches.

Use this if you need to perform high-throughput, billion-scale similarity searches on vector datasets with a single GPU, where the dataset is too large to fit entirely in GPU memory.

Not ideal if your datasets are small, you don't have access to high-end GPU and SSD hardware, or you require a ready-to-use solution without significant system-level setup.

vector-search similarity-matching large-scale-data machine-learning-inference data-retrieval

No License No Package No Dependents

Maintenance 10 / 25

Adoption 6 / 25

Maturity 7 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Cuda

License

—

Higher-rated alternatives

superlinked/sie

Superlinked Inference Engine is an Open-source inference server and production cluster for...

unohee/OpenSwarm

OpenSwarm — Autonomous AI dev team orchestrator powered by Claude Code CLI. Discord control,...

BlackyDrum/chromadb-ui

Modern ChromaDB web UI for browsing collections, inspecting records, running queries, and...

qdrant/skills

Agent skills for Qdrant vector search: scaling, performance optimization, search quality,...

DopamineDriven/slipstream

byok multi-provider medium supporting OpenAI, Anthropic, xAI, Google, Meta, & Mistral models...

Explore Vector Databases

All categories Trending Vector Database directory Insights