Server Embedding Tools

There are 27 server tools tracked. The highest-rated is byte5ai/palaia at 49/100 with 4 stars.

Get all 27 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=embeddings&subcategory=server&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 byte5ai/palaia

Palaia — Local, crash-safe memory for AI agents. Semantic vector search...

49
Emerging
2 ddickmann/vllm-factory

Production inference for encoder models - ColBERT, GLiNER, ColPali,...

42
Emerging
3 j33pguy/magi

MAGI — Multi-Agent Graph Intelligence. Universal memory server for AI...

36
Emerging
4 LLMSystems/TensorrtServer

A high-performance deep learning model inference server based on TensorRT,...

32
Emerging
5 abdullah85398/embedding-server

A high-performance, self-hosted, model-agnostic embedding service designed...

30
Emerging
6 alez007/yasha

Self-hosted, multi-model AI inference server. Run LLMs, TTS, STT,...

29
Experimental
7 michaelkrauty/mcp-docs

MCP server for document management — multi-format extraction, semantic...

26
Experimental
8 michaelkrauty/vector-core

Shared vector search infrastructure for MCP servers — embeddings, hybrid...

26
Experimental
9 amgix/amgix-server

Amalgam Index (Amgix) is an open-source hybrid search system

26
Experimental
10 thetenzinwoser/recall-mcp

Local semantic search MCP server for markdown docs and Granola meeting...

25
Experimental
11 entangelk/agent-memory-system-public

Hierarchical long-term memory architecture for AI assistants with MCP support

24
Experimental
12 Kenny1338/Librarian

Persistent memory for AI agents. Observes conversations, extracts facts via...

23
Experimental
13 Obelus-Labs-LLC/Nexus

Semantic codebase graph engine — MCP server for Claude Code

22
Experimental
14 kalikin-artem/proj-mcp-recall-md

Local semantic search for markdown notes — MCP server

22
Experimental
15 damiandelmas/flex-claudecode

Claude Code session search — flex module

22
Experimental
16 MBaranekTech/pdf-rag-mcp

MCP server for RAG over messy PDFs — semantic search, OCR, table extraction....

22
Experimental
17 raspoli/mlx-serve

Local inference server for Apple Silicon — hot-swaps MLX models (LLM,...

22
Experimental
18 getreka/reka

Memory infrastructure for AI coding assistants — Claude Code plugin, MCP...

22
Experimental
19 apresai/2ndbrain

AI-native markdown knowledge base with semantic search, RAG, and MCP server

22
Experimental
20 CynepMyx/deja

Semantic search MCP server for Claude Code sessions. pip install dejasearch

22
Experimental
21 incommodious-southamericancountry546/ScrcpyForAndroid

Run an Android scrcpy client with mDNS auto-connect, wireless debug device...

22
Experimental
22 joeywhelan/clip-demo

Example usage of the Jina ClipV2 model from Elastic Inference Service

22
Experimental
23 websmartshubhamk/memorygraph

Entity-anchored, vector-searchable, salience-weighted graph memory MCP...

22
Experimental
24 john-hoe/vega-memory

Local-first memory server for AI tools and agents. Persistent cross-session...

14
Experimental
25 uday160386/production-ready-rag-solution

Cache-Augmented Generation with Context Engineering, Semantic Search,...

14
Experimental
26 AdelElo13/neuromcp

Semantic memory for AI agents — local-first MCP server with hybrid search,...

14
Experimental
27 bitsandbrainsai/enterprise-rag-multilingual-knowledge-engine

Enterprise-grade multilingual RAG knowledge engine implementing...

11
Experimental