mindtro/semafold

Vector compression with TurboQuant codecs for embeddings, retrieval, and KV-cache. 10x compression, pure NumPy core — optional GPU acceleration via PyTorch (CUDA/MPS) or MLX (Metal).

/ 100

Emerging

This project helps AI infrastructure teams reduce the storage footprint of numerical AI data. It takes large embedding vectors or key-value tensors and outputs significantly smaller, compressed versions, maintaining data quality. This is for engineers and architects building AI systems who need to manage large volumes of AI-specific data more efficiently.

Use this if you need to dramatically shrink the size of your AI embeddings or KV-cache tensors to save storage and improve performance within your AI infrastructure.

Not ideal if you're looking to summarize text or reduce the number of tokens in natural language processing tasks, as it focuses purely on numerical data compression.

AI-infrastructure vector-databases machine-learning-operations embedding-storage inference-optimization

No Package No Dependents

Maintenance 13 / 25

Adoption 6 / 25

Maturity 9 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

superlinked/sie

Superlinked Inference Engine is an Open-source inference server and production cluster for...

unohee/OpenSwarm

OpenSwarm — Autonomous AI dev team orchestrator powered by Claude Code CLI. Discord control,...

BlackyDrum/chromadb-ui

Modern ChromaDB web UI for browsing collections, inspecting records, running queries, and...

qdrant/skills

Agent skills for Qdrant vector search: scaling, performance optimization, search quality,...

DopamineDriven/slipstream

byok multi-provider medium supporting OpenAI, Anthropic, xAI, Google, Meta, & Mistral models...

Explore Vector Databases

All categories Trending Vector Database directory Insights