msoedov/vector_lake
S3 vector database for LLM Agents and RAG.
VectorLake is designed for machine learning engineers and data scientists who need to store and query high-dimensional data efficiently and cost-effectively. It helps manage the large sets of numerical 'embeddings' that come out of machine learning models. You can feed it these embeddings, and it allows you to quickly find similar data points, especially useful for applications like powering AI assistants or recommendation systems.
Available on PyPI.
Use this if you need a low-maintenance, scalable, and cost-effective way to store and query vast amounts of numerical embedding data for your machine learning applications, particularly when using S3 storage.
Not ideal if you're looking for a full-featured transactional database or if your primary need isn't handling large-scale vector similarity search.
Stars
57
Forks
4
Language
Python
License
MIT
Category
Last pushed
Jan 28, 2026
Commits (30d)
0
Dependencies
8
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/vector-db/msoedov/vector_lake"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
pixeltable/pixeltable
Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.
activeloopai/deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store,...
superlinked/VectorHub
VectorHub is a free, open-source learning website for people (software developers to senior ML...
hhblaze/DBreeze
C# .NET NOSQL ( key value, object store embedded TextSearch SemanticSearch Vector layer ) ACID...
TileDB-Inc/TileDB-Vector-Search
Cloud-native vector similarity search and storage with efficient, serverless scale-out