All Embedding Tools

4,115 tools ranked by quality score

Showing 1–100 of 4,115
# Tool Score Tier
1 embeddings-benchmark/mteb

MTEB: Massive Text Embedding Benchmark

86
Verified
2 trustgraph-ai/trustgraph

The context development platform. Store, enrich, and retrieve structured...

80
Verified
3 FlagOpen/FlagEmbedding

Retrieval and Retrieval-augmented LLMs

76
Verified
4 MinishLab/model2vec

Fast State-of-the-Art Static Embeddings

74
Verified
5 xhluca/bm25s

Fast lexical search implementing BM25 in Python

74
Verified
6 inception-project/inception

INCEpTION provides a semantic annotation platform offering intelligent...

73
Verified
7 srbhr/Resume-Matcher

Improve your resumes with Resume Matcher. Get insights, keyword suggestions...

72
Verified
8 typesense/typesense

Open Source alternative to Algolia + Pinecone and an Easier-to-Use...

71
Verified
9 qdrant/fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

71
Verified
10 aiming-lab/SimpleMem

SimpleMem: Efficient Lifelong Memory for LLM Agents

70
Verified
11 vllm-project/semantic-router

System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

70
Verified
12 airweave-ai/airweave

Open-source context retrieval layer for AI agents

69
Established
13 dtsola/xiaoyaosearch

小遥搜索,听懂你的话、看懂你的图,用AI找到本地任何文件。让搜索像聊天一样简单。XiaoyaoSearch: Understands your...

67
Established
14 Blaizzy/mlx-embeddings

MLX-Embeddings is the best package for running Vision and Language Embedding...

66
Established
15 docarray/docarray

Represent, send, store and search multimodal data

66
Established
16 getzep/zep

Zep | Examples, Integrations, & More

65
Established
17 shibing624/text2vec

text2vec, text to vector....

65
Established
18 roshan-research/hazm

Persian NLP Toolkit

65
Established
19 aws-samples/amazon-bedrock-samples

This repository contains examples for customers to get started using the...

64
Established
20 brianpetro/obsidian-smart-connections

Chat with your notes & see links to related content with AI embeddings. Use...

64
Established
21 Anush008/fastembed-rs

Rust library for vector embeddings and reranking.

64
Established
22 shibing624/similarities

Similarities: a toolkit for similarity calculation and semantic search....

64
Established
23 gorse-io/gorse

AI powered open source recommender system engine supports classical/LLM...

63
Established
24 Merck/Sapiens

Sapiens is a human antibody language model based on BERT.

63
Established
25 lfnovo/esperanto

A unified interface for various AI model providers

63
Established
26 NotJoeMartinez/yt-fts

YouTube Full Text Search - Search all of YouTube from the command line

62
Established
27 cosmosgl/graph

GPU-accelerated force graph layout and rendering

62
Established
28 huggingface/text-embeddings-inference

A blazing fast inference solution for text embeddings models

62
Established
29 eliorc/node2vec

Implementation of the node2vec algorithm.

62
Established
30 cocoindex-io/cocoindex

Data transformation framework for AI. Ultra performant, with incremental...

62
Established
31 zilliztech/memsearch

A Markdown-first memory system, a standalone library for any AI agent....

61
Established
32 deepset-ai/haystack-core-integrations

Additional packages (components, document stores and the likes) to extend...

61
Established
33 Azure/azure-search-vector-samples

A repository of code samples for Vector search capabilities in Azure AI Search.

61
Established
34 justincasher/lean-explore

A search engine for Lean 4 declarations

60
Established
35 microsoft/simplechat

Secure AI conversations with documents, video, audio, and more. Personal...

60
Established
36 apocas/restai

RESTai is an AIaaS (AI as a Service) open-source platform. Built on top of...

60
Established
37 deepset-ai/haystack-tutorials

Here you can find all the Tutorials for Haystack 📓

60
Established
38 explosion/sense2vec

🦆 Contextually-keyed word vectors

59
Established
39 jparkerweb/semantic-chunking

🍱 semantic-chunking ⇢ semantically create chunks from large document for...

59
Established
40 ssrajadh/sentrysearch

Semantic search over videos using Gemini Embedding 2.

58
Established
41 jina-ai/clip-as-service

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

58
Established
42 predict-idlab/pyRDF2Vec

🐍 Python Implementation and Extension of RDF2Vec

58
Established
43 TorchDR/TorchDR

TorchDR - PyTorch Dimensionality Reduction

58
Established
44 Dadmatech/DadmaTools

DadmaTools is a Persian NLP tools developed by Dadmatech Co.

58
Established
45 Ryandonofrio3/osgrep

Open Source Semantic Search for your AI Agent

58
Established
46 MilaNLProc/contextualized-topic-models

A python package to run contextualized topic modeling. CTMs combine...

58
Established
47 yoanbernabeu/grepai

Semantic Search & Call Graphs for AI Agents (100% Local)

58
Established
48 lotus-data/lotus

AI-Powered Data Processing: Use LOTUS to process all of your datasets with...

58
Established
49 aryn-ai/sycamore

🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.

58
Established
50 Clay-foundation/model

The Clay Foundation Model - An open source AI model and interface for Earth

57
Established
51 michaelfeil/infinity

Infinity is a high-throughput, low-latency serving engine for...

57
Established
52 curiosity-ai/catalyst

🚀 Catalyst is a C# Natural Language Processing library built for speed....

57
Established
53 NyanNyanovich/nyan

Automatic news aggregator in Telegram / Автоматический агрегатор новостей в Телеграме

57
Established
54 gao-lab/GLUE

Graph-linked unified embedding for single-cell multi-omics data integration

57
Established
55 patrickfrank1/chesspos

Embedding based chess position search and embedding learning for chess positions

57
Established
56 AmenRa/retriv

A Python Search Engine for Humans 🥸

57
Established
57 unum-cloud/UForm

Pocket-Sized Multimodal AI for content understanding and generation across...

57
Established
58 neuml/annotateai

📝 Automatically annotate papers using LLMs

57
Established
59 infinilabs/coco-server

🥥 Coco AI Server - Search, Connect, Collaborate, AI-powered Enterprise...

56
Established
60 harmonydata/harmony

The Harmony Python library: a research tool for psychologists to harmonise...

56
Established
61 usc-isi-i2/kgtk

Knowledge Graph Toolkit

56
Established
62 EBISPOT/ols4

The EMBL-EBI Ontology Lookup Service (OLS)

56
Established
63 insideout10/wordlift-plugin

WordLift brings the power of Artificial Intelligence to beautifully organize...

56
Established
64 supabase/embeddings-generator

GitHub Action to generate embeddings from the markdown files in your repository.

56
Established
65 microsoft/kernel-memory

Research project. A Memory solution for users, teams, and applications.

56
Established
66 IntuitionEngineeringTeam/chars2vec

Character-based word embeddings model based on RNN for handling real world texts

56
Established
67 snap-stanford/stark

(NeurIPS D&B 2024) STaRK: Benchmarking LLM Retrieval on Textual and...

56
Established
68 derrickburns/generalized-kmeans-clustering

Production-ready K-Means clustering for Apache Spark with pluggable Bregman...

56
Established
69 Michael-JB/bm25

A BM25 embedder, scorer, and search engine, written in Rust.

56
Established
70 winkjs/wink-bm25-text-search

Fast Full Text Search based on BM25

56
Established
71 IITH-Compilers/IR2Vec

Implementation of IR2Vec, LLVM IR Based Scalable Program Embeddings

56
Established
72 MaartenGr/PolyFuzz

Fuzzy string matching, grouping, and evaluation.

56
Established
73 Accenture/AmpliGraph

Python library for Representation Learning on Knowledge Graphs...

56
Established
74 ddangelov/Top2Vec

Top2Vec learns jointly embedded topic, document and word vectors.

56
Established
75 MinishLab/model2vec-rs

Official Rust Implementation of Model2Vec

56
Established
76 zilliztech/GPTCache

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

56
Established
77 rom1504/clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

56
Established
78 AnswerDotAI/ModernBERT

Bringing BERT into modernity via both architecture changes and scaling

55
Established
79 TeleAI-UAGI/telemem

TeleMem is a high-performance drop-in replacement for Mem0, featuring...

55
Established
80 unum-cloud/USearch

Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary...

55
Established
81 nomic-ai/nomic

Nomic Developer API SDK

54
Established
82 snap-research/GRID

GRID: Generative Recommendation with Semantic IDs

54
Established
83 ContextualAI/gritlm

Generative Representational Instruction Tuning

54
Established
84 chakki-works/chakin

Simple downloader for pre-trained word vectors

54
Established
85 alexklibisz/elastiknn

Elasticsearch plugin for nearest neighbor search. Store vectors and run...

54
Established
86 RichmondAlake/memorizz

MemoRizz: A Python library serving as a memory layer for AI applications....

54
Established
87 tetherto/qvac

QVAC - Local AI SDK and libraries for building private, cross-platform,...

54
Established
88 towhee-io/towhee

Towhee is a framework that is dedicated to making neural data processing...

54
Established
89 vector-ai/vectorai

Vector AI — A platform for building vector based applications. Encode, query...

54
Established
90 Azure-Samples/azure-ai-document-processing-samples

A collection of samples demonstrating techniques for processing documents...

54
Established
91 probelabs/probe

AI-friendly semantic code search engine for large codebases. Combines...

53
Established
92 ascottbell/maasv

Memory Architecture as a Service — cognition layer for AI assistants....

53
Established
93 starthackHQ/Contextinator

Turning messy repos into weapons of mass structured context.

53
Established
94 milvus-io/milvus-model

A library integrating embedding and reranker models from OpenAI,...

53
Established
95 gmickel/gno

Local AI-powered document search and editing with first-in-class hybrid...

53
Established
96 voyage-ai/voyageai-python

Voyage AI Official Python Library

53
Established
97 LongxingTan/open-retrievals

All-in-One: Text Embedding, Retrieval, Reranking and RAG in Transformers

53
Established
98 awinml/voyage-embedders-haystack

Custom components for Haystack for creating embeddings and reranking...

53
Established
99 freedmand/semantra

Multi-tool for semantic search

53
Established
100 artitw/text2text

Text2Text Language Modeling Toolkit

53
Established
1 2 3 40 41 42 Next »