Self-Hosted Embedding Servers Embedding Tools

Deployable embedding API services that run locally or on your own infrastructure, providing OpenAI-compatible or custom endpoints. Does NOT include embedding models themselves, inference libraries, or managed embedding API providers.

There are 90 self-hosted embedding servers tools tracked. 2 score above 70 (verified tier). The highest-rated is FlagOpen/FlagEmbedding at 76/100 with 11,395 stars. 2 of the top 10 are actively maintained.

Get all 90 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=embeddings&subcategory=self-hosted-embedding-servers&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 FlagOpen/FlagEmbedding

Retrieval and Retrieval-augmented LLMs

76
Verified
2 qdrant/fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

71
Verified
3 Blaizzy/mlx-embeddings

MLX-Embeddings is the best package for running Vision and Language Embedding...

66
Established
4 Merck/Sapiens

Sapiens is a human antibody language model based on BERT.

63
Established
5 amansrivastava17/embedding-as-service

One-Stop Solution to encode sentence to fixed length vectors from various...

53
Established
6 jkrukowski/swift-embeddings

Run embedding models locally in Swift using MLTensor.

52
Established
7 jina-ai/examples

Jina examples and demos to help you get started

51
Established
8 freelawproject/inception

Our microservice for generating embeddings from blocks of text

50
Established
9 IlyasMoutawwakil/py-txi

A Python wrapper around HuggingFace's TGI (text-generation-inference) and...

49
Emerging
10 minimaxir/imgbeddings

Python package to generate image embeddings with CLIP without PyTorch/TensorFlow

48
Emerging
11 simonw/llm-embed-jina

Embedding models from Jina AI

46
Emerging
12 lh0x00/lightweight-embeddings

LightweightEmbeddings is a fast, free, and unlimited API service for...

44
Emerging
13 ddangelov/RESTful-Top2Vec

Expose a Top2Vec model with a REST API.

44
Emerging
14 dayyass/muse-as-service

REST API for sentence tokenization and embedding using Multilingual...

43
Emerging
15 rag-wtf/open-text-embeddings

Open Source Text Embedding Models with OpenAI Compatible API

43
Emerging
16 josephrmartinez/recipe-dataset

Datasette tutorial. Calculate and query embeddings on 5,000 rows in a sqlite...

42
Emerging
17 LLukas22/tei-client

Convenience Client for Hugging Face Text Embeddings Inference (TEI) with...

42
Emerging
18 yuvrajangadsingh/vemb

httpie for embeddings. Embed text, images, audio, video, and PDFs from the...

39
Emerging
19 ART-Group-it/KERMIT

🐸 KERMIT - A lightweight library to encode and interpret Universal...

39
Emerging
20 jina-ai/jina-grep-cli

Semantic grep powered by Jina embeddings v5 (MLX on Apple Silicon)

39
Emerging
21 jina-ai/jina-sagemaker

Jina Embedding Models on AWS SageMaker

38
Emerging
22 ejaasaari/lemur

LEMUR reduces multi-vector retrieval for late interaction models such as...

38
Emerging
23 jakedahn/qwen3-embeddings-mlx

MLX-powered Qwen3 embedding server for Apple Silicon Macs. Features ...

37
Emerging
24 jina-ai/mlx-retrieval

Train embedding and reranker models for retrieval tasks on Apple Silicon with MLX

37
Emerging
25 struct-chat/embedding

Vector Embedding Server in under 100 lines of code

36
Emerging
26 toshsan/embedding-server

Drop in replacement for OpenAI's embedding API. Self Hosted.

36
Emerging
27 louisbrulenaudet/lemone-api

Lemone: the API for french tax law and embeddings computation 🇫🇷

35
Emerging
28 MindHackingHappiness/EI-harness-lite

Light Python 3.x+ wrapper for our MHH EI_for_AI super prompt. Also js client.

34
Emerging
29 IvanCampos/openai-text-embedding

Uncover hidden connections and find the most semantically similar text to...

33
Emerging
30 dadoomer/sentence-transformers-server

Your own API endpoint to perform NLP functions like semantic search,...

33
Emerging
31 623637646/EmbeddedScrollView

Embedded UIScrollView for iOS.

33
Emerging
32 n24q02m/qwen3-embed

Lightweight ONNX inference for Qwen3 embedding and reranking models

33
Emerging
33 jina-ai/cli

All Jina AI APIs as Unix CLI commands. Search, read, embed, rerank - with pipes.

32
Emerging
34 dnys1/embedding_explorer

Experiment with text embedding models locally in your browser.

30
Emerging
35 noe/seqp

Sequence persistence library for Python

28
Experimental
36 dust-ai-mr/dust-nlp

Dust Actor library for interacting with LLMs and embedding engines

27
Experimental
37 Maplecoder18/Qwen3-VL-Embedding

🌟 Enhance visual and textual understanding with Qwen3-VL-Embedding and...

27
Experimental
38 aicubetechnology/aicube-embedding2embedding

AICUBE Embedding2Embedding - Unlock advanced embedding translation between...

27
Experimental
39 thinkbigcd/embedding-service

api service for generating and managing text embeddings

24
Experimental
40 artryazanov/embedding-service

This is a FastAPI-based service for generating text embeddings, supporting...

24
Experimental
41 dsjacobsen/embedding-service

A high-performance FastAPI service that generates vector embeddings for...

24
Experimental
42 Vokturz/fast-embeddings-api

fast-embeddings-api

23
Experimental
43 theseedship/n8n_embeddings_qwen3_integration

Use this advanced node (tool or embedding) for Qwen3 embeddings (fit all...

23
Experimental
44 fahmiaziz98/unified-embedding-api

A modular and open-source RAG-ready Embedding API supporting dense, sparse...

23
Experimental
45 startupradar/demo-find-similar-startups

Find similar startups with our API and OpenAI's embeddings

23
Experimental
46 elvatis/openclaw-gpu-bridge

OpenClaw plugin: Offload heavy compute (embeddings, BERTScore) to a remote GPU server

22
Experimental
47 bambara-martial/jina-grep-cli

Enable semantic grep and code search locally on Apple Silicon using Jina...

21
Experimental
48 ChasingBlu/CAIROS_Daemon

Python/ C/C++ embedding pipeline with a 2d-3d vector-coordinates converter....

21
Experimental
49 rogelioRuiz/dust-embeddings-swift

Standalone tokenizers and embedding runtime primitives for Dust — iOS/macOS

21
Experimental
50 rogelioRuiz/dust-embeddings-capacitor

On-device text embedding generation for iOS and Android via Capacitor

21
Experimental
51 cwccie/netembeddings

Pre-computed vector embeddings for networking concepts — RFCs, CLI commands,...

21
Experimental
52 enot-style/embeddings

OpenAI-compatible /v1/embeddings API for local Hugging Face text embedding...

21
Experimental
53 ethanlee928/mlx-embeddings-server

This package offers an OpenAI-compatible API server for mlx-embeddings

21
Experimental
54 ayinedjimi/CUDAEmbeddings

GPU-accelerated embedding server for RAG systems - CUDA, FastAPI,...

21
Experimental
55 enot-style/imbeddings

A minimal FastAPI service for generating image embeddings using Hugging Face...

21
Experimental
56 thiagosilvahyper/bihe-quantization

BIHE Protocol - Next-generation vector quantization combining E8 lattice...

21
Experimental
57 rogelioRuiz/dust-embeddings-kotlin

Standalone tokenizers and embedding runtime primitives for on-device text embeddings

21
Experimental
58 moda20/mes

Multimodal Embedding Service : This is a vibecodded application to serve as...

20
Experimental
59 devflowinc/openembeddings

Self-hostable pay for what you use embedding server for bge-large-en and...

19
Experimental
60 kemingy/mosec_emb

Embedding service with mosec that is compatible with OpenAI API.

19
Experimental
61 different-ai/embedbase-js

moved https://github.com/different-ai/embedbase/tree/main/sdk/embedbase-js

19
Experimental
62 AlwaysSany/huggingface-local-embedding

A Fast API server that provides local text and multi-modal embedding using...

18
Experimental
63 acantarero/embedding_service

FastAPI service to generate text embeddings. Currently supports instructor...

17
Experimental
64 Blase-Labs/blase

'blase' is a Python library that enables users to train neural networks...

17
Experimental
65 afriddev/EmbeRankis

EmbeRankis an open-source, production-ready service for embeddings and...

17
Experimental
66 rhangelxs/russian_embeddings

API server for word embeddings for Russian language

17
Experimental
67 kaovern/embeddrix

A stupid simple service to generate text embeddings

17
Experimental
68 MongoExpUser/Text-and-Image-Embeddings-for-PostgreSQL

Generate Text and Image Embeddings

17
Experimental
69 didinj/embeddings-and-vector-database-examples

Everything You Need to Know About Embeddings and Vector Databases

17
Experimental
70 TriDefender/jina-embedding-server

I rewrote the wheel so you don't have to pay for embed or rerank. The...

16
Experimental
71 aperepel/mlx-serve-embeddings

Local embeddings server for Apple Silicon using MLX, providing...

15
Experimental
72 bdllhdrss3/EmbedServe

A high-performance, local embedding service for running models like Qwen,...

14
Experimental
73 arterm-sedov/cmw-infinity

Infinity server setup and management for Infinity embedding and reranking...

14
Experimental
74 back2matching/turboquant-vectors

Compress embeddings 6x instantly with TurboQuant. First pip package using...

14
Experimental
75 CtrlAltElite-Devs/embedding.worker.faculytics

Embedding Worker for Faculytics 2.0

14
Experimental
76 K31NER/openai-embeddings-proxy

Proxy de embeddings compatible con la API de OpenAI en FastAPI que expone un...

13
Experimental
77 tubers9312345/mlx-serve-embeddings

🧠 Run local Apple Silicon embedding models with MLX, offering fast, private,...

13
Experimental
78 anvitha-sm/embedvisor

Embeddings web app + package + CLI in Python for data preprocessing:...

13
Experimental
79 SharvenRane/feature-store

Feature store implementation for image embeddings using Redis and Feast

13
Experimental
80 Alex-ML-labs/text-embedding-service-MLA-

FastAPI service for sentence embeddings & cosine similarity (MiniLM-L6-v2)....

13
Experimental
81 ggwozdz90/embed-api

API for text embeddings using BGE-M3 model. Supports dense, sparse, and...

13
Experimental
82 nakedcity/zephyr

OpenAI-compatible embedding server built on pure ONNX Runtime—fast starts,...

12
Experimental
83 yunwoong7/text-embedding-toolkit

A powerful toolkit for text chunking and semantic search using OpenSearch....

11
Experimental
84 loopyd/sd-embeddings-sync

Manage stable diffusion embeddings the draconic way

11
Experimental
85 kiyanmair/simple-model-service

Simple self-hosted embedding model service

11
Experimental
86 phymbert/e4b

Embeddings database in C/C++

11
Experimental
87 TtheBC01/openai-embeddings

demo of openai's embeddings api

11
Experimental
88 mashhurs/logstash-filter-embeddings_generator

Embeddings generator plugin for Logstash

11
Experimental
89 taqu/sequence-transformers-api

Embedding API server using Sequence Transformers

11
Experimental
90 seonglae/tei

Text Embeddings Inference (TEI)'s unofficial python wrapper library for...

10
Experimental