philpher0x/vectrain
Vectrain is a high-performance, modular, plug-and-play RAG pipeline that ingests data, generates vector embeddings, and stores them in vector databases for semantic search, recommendations, and analytics.
This tool helps developers transform raw text data, like customer reviews or product descriptions, into numerical 'vector embeddings' that power smart search, recommendations, and analytics. It takes data streams from sources like Kafka or REST endpoints, processes them through an embedding model, and then stores these embeddings in vector databases. Developers or MLOps engineers would use this to build and manage the data backend for AI applications.
No commits in the last 6 months.
Use this if you are a developer building an application that needs to perform semantic search, power recommendations, or analyze large volumes of unstructured text data using vector embeddings.
Not ideal if you need a fully managed, no-code solution for generating embeddings or if your application requires a different set of data sources, embedding models, or vector databases than what is currently supported.
Stars
14
Forks
2
Language
Go
License
MIT
Category
Last pushed
Sep 26, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/vector-db/philpher0x/vectrain"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
pixeltable/pixeltable
Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.
activeloopai/deeplake
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store,...
superlinked/VectorHub
VectorHub is a free, open-source learning website for people (software developers to senior ML...
hhblaze/DBreeze
C# .NET NOSQL ( key value, object store embedded TextSearch SemanticSearch Vector layer ) ACID...
TileDB-Inc/TileDB-Vector-Search
Cloud-native vector similarity search and storage with efficient, serverless scale-out