All Embedding Tools

4,115 tools ranked by quality score · Page 3 of 42

Showing 201–300 of 4,115
# Tool Score Tier
201 OpenConceptLab/oclmap

OCL Mapper (beta): an open-source AI-supported terminology mapping solution...

48
Emerging
202 supabase/headless-vector-search

Supabase Toolkit to perform vector similarity search on your knowledge base...

47
Emerging
203 curiosity-ai/umap-sharp

C# library for fast embeddings projection using Uniform Manifold...

47
Emerging
204 D2KLab/entity2rec

entity2rec generates item recommendation using property-specific knowledge...

47
Emerging
205 DeepGraphLearning/graphvite

GraphVite: A General and High-performance Graph Embedding System

47
Emerging
206 mims-harvard/scikit-fusion

scikit-fusion: Data fusion via collective latent factor models

47
Emerging
207 Snehil-Shah/Multimodal-Image-Search-Engine

Text to Image & Reverse Image Search Engine built upon Vector Similarity...

47
Emerging
208 pdrm83/sent2vec

How to encode sentences in a high-dimensional vector space, a.k.a., sentence...

47
Emerging
209 mims-harvard/SHEPHERD

SHEPHERD: Few shot learning for phenotype-driven diagnosis of patients with...

47
Emerging
210 iamaziz/ar-embeddings

Sentiment Analysis for Arabic Text (tweets, reviews, and standard Arabic)...

47
Emerging
211 ina-foss/twembeddings

Sentence embeddings for unsupervised event detection in the Twitter stream:...

47
Emerging
212 finalfusion/finalfusion-rust

finalfusion embeddings in Rust

47
Emerging
213 revokslab/codecrawl

๐ŸŒŠ Turn entire codebases into LLM-ready data. Extract data, search, and...

47
Emerging
214 s-emanuilov/litepali

LitePali is a minimal, efficient implementation of ColPali for image...

47
Emerging
215 alexshtf/torchcurves

Parametric differentiable curves with PyTorch for continuous embeddings,...

47
Emerging
216 ProviderProtocol/ai

0-DEP AI DX SDK

47
Emerging
217 FullStackWithLawrence/openai-embeddings

OpenAI chatGPT hybrid search and retrieval augmented generation

47
Emerging
218 MinishLab/tokenlearn

Pre-train Static Word Embeddings

47
Emerging
219 abojchevski/graph2gauss

Gaussian node embeddings. Implementation of "Deep Gaussian Embedding of...

47
Emerging
220 eugeneyan/semantic-ids-llm

Semantic IDs: How to train an LLM-Recommender Hybrid with steerability and...

46
Emerging
221 choihyunsus/n2-mimir

AI Experience Learning Engine โ€” AI agents remember, but don't learn. Mimir...

46
Emerging
222 nomic-ai/semantic-search-app-template

Tutorial and template for a semantic search app powered by the Atlas...

46
Emerging
223 similigh/simili-bot

AI-powered GitHub issue intelligence - semantic duplicate detection,...

46
Emerging
224 PkuRainBow/HDC.caffe

Complete Code for "Hard-Aware-Deeply-Cascaded-Embedding"

46
Emerging
225 Agrover112/awesome-semantic-search

A curated list of awesome resources related to Semantic Search๐Ÿ”Ž and...

46
Emerging
226 oborchers/Fast_Sentence_Embeddings

Compute Sentence Embeddings Fast!

46
Emerging
227 aws-samples/sample-extreme-text-classifier

A Python text classifier for large-scale multi-class classification using...

46
Emerging
228 yusufhilmi/client-vector-search

A client side vector search library that can embed, store, search, and cache...

46
Emerging
229 nlpcloud/nlpcloud-js

NLP Cloud serves high performance pre-trained or custom models for NER,...

46
Emerging
230 JGalego/RAGmap

A simple Streamlit application to visualize document chunks and queries in...

46
Emerging
231 fresh-stack/freshstack

This repository helps you evaluate your models on the FreshStack benchmark!

46
Emerging
232 eugeneyan/ml-surveys

๐Ÿ“‹ Survey papers summarizing advances in deep learning, NLP, CV, graphs,...

46
Emerging
233 DeepChainBio/bio-transformers

bio-transformers is a wrapper on top of the ESM/Protbert model, trained on...

46
Emerging
234 Muvon/octolib

The lib to power AI tools.

46
Emerging
235 simonw/llm-embed-jina

Embedding models from Jina AI

46
Emerging
236 tca19/dict2vec

Dict2vec is a framework to learn word embeddings using lexical dictionaries.

46
Emerging
237 hayabhay/frogbase

Transform audio-visual content into navigable knowledge.

46
Emerging
238 aws-samples/news-clustering-and-summarization

This repository contains code for a near real-time news clustering and...

46
Emerging
239 decodingai-magazine/tabular-semantic-search-tutorial

๐Ÿ“š Tutorial on building a modern search app for Amazon e-commerce products...

46
Emerging
240 Zefan-Cai/R-KV

[Neurips 2025] R-KV: Redundancy-aware KV Cache Compression for Reasoning Models

46
Emerging
241 aiplanethub/beyondllm

Build, evaluate and observe LLM apps

46
Emerging
242 sede-open/Fleming

Fleming repo to run semantic search models on databricks on CPU.

46
Emerging
243 mims-harvard/SubGNN

Subgraph Neural Networks (NeurIPS 2020)

46
Emerging
244 raphaelsty/neural-cherche

Neural Search

46
Emerging
245 SeanLee97/AnglE

Train and Infer Powerful Sentence Embeddings with AnglE | ๐Ÿ”ฅ SOTA on STS and...

46
Emerging
246 mims-harvard/GraphXAI

GraphXAI: Resource to support the development and evaluation of GNN explainers

46
Emerging
247 lgalke/vec4ir

Word Embeddings for Information Retrieval

46
Emerging
248 langformers/langformers

๐Ÿš€ Unified NLP Pipelines for Language Models

46
Emerging
249 Graphlet-AI/eridu

Deep fuzzy matching people and company names for multilingual entity...

46
Emerging
250 rom1504/image_embeddings

Using efficientnet to provide embeddings for retrieval

46
Emerging
251 build-on-aws/langchain-embeddings

This repository demonstrates the construction of a state-of-the-art...

46
Emerging
252 sacdallago/bio_embeddings

Get protein embeddings from protein sequences

46
Emerging
253 MilaNLProc/honest

A Python package to compute HONEST, a score to measure hurtful sentence...

46
Emerging
254 DmitryKey/bert-solr-search

Search with BERT vectors in Solr, Elasticsearch, OpenSearch and GSI APU

45
Emerging
255 bnosac/ruimtehol

R package to Embed All the Things! using StarSpace

45
Emerging
256 Addepto/graph_builder

Open-source toolkit to extract structured knowledge graphs from documents...

45
Emerging
257 haven-jeon/LegalQA

Korean LegalQA using SentenceKoBART

45
Emerging
258 SeekAI-786/Resume-Analyzer

Resume Analyzer is a prototype web application that allows users to upload...

45
Emerging
259 maxoodf/word2vec

word2vec++ is a Distributed Representations of Words (word2vec) library and...

45
Emerging
260 finalfusion/finalfusion-python

Finalfusion embeddings in Python

45
Emerging
261 etalab-ia/mediatech

Collection of public datasets from the French administration, vectorized and...

45
Emerging
262 veekaybee/what_are_embeddings

A deep dive into embeddings starting from fundamentals

45
Emerging
263 smeznar/HVAE

An approach for embedding hierarhical structures into a continuous vector...

45
Emerging
264 Terronex-dev/aifbin-pro

AIF-BIN Pro โ€” Professional AI Memory Management with Semantic Search

45
Emerging
265 babylonhealth/fastText_multilingual

Multilingual word vectors in 78 languages

45
Emerging
266 ibm-self-serve-assets/Watson-NLP

This collection demonstrates how to help you to quickly embed Watson NLP in...

45
Emerging
267 HKUDS/XRec

[EMNLP'2024] "XRec: Large Language Models for Explainable Recommendation"

45
Emerging
268 EveripediaNetwork/fastc

Unattended Lightweight Text Classifiers with LLM Embeddings

45
Emerging
269 jared-goering/ultramemory

Local-first AI memory engine with relational versioning, temporal grounding,...

45
Emerging
270 persiyanov/skip-thought-tf

An implementation of skip-thought vectors in Tensorflow

45
Emerging
271 raphaelsty/cherche

Neural Search

45
Emerging
272 malllabiisc/cesi

WWW 2018: CESI: Canonicalizing Open Knowledge Bases using Embeddings and...

45
Emerging
273 drittich/SemanticSlicer

๐Ÿง โœ‚๏ธ SemanticSlicer โ€” A smart text chunker for LLM-ready documents.

45
Emerging
274 xlang-ai/instructor-embedding

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

45
Emerging
275 wikipedia2vec/wikipedia2vec

A tool for learning vector representations of words and entities from Wikipedia

45
Emerging
276 noi-techpark/stuart-chatbot

Stuart is simple RAG System, that the Open Data Hub uses as a chatbot to...

45
Emerging
277 bnosac/doc2vec

Distributed Representations of Sentences and Documents

45
Emerging
278 sashakolpakov/graphem-rapids

Graph embedding for influence maximization in networks

45
Emerging
279 MaartenGr/VLAC

Vectors of Locally Aggregated Concepts

45
Emerging
280 UniverseTBD/platonic-universe

Do foundation models see the same sky? ๐Ÿ”ฎ

45
Emerging
281 jeanCarloMachado/PythonSearch

A minimalistic search engine for productivity that stores documents as code

45
Emerging
282 Azure-Samples/azure-sql-db-session-recommender

Build a recommender using OpenAI, Azure Functions, Azure Static Web Apps,...

45
Emerging
283 Dicklesworthstone/frankensearch

Two-tier hybrid search for Rust: sub-millisecond initial results via...

45
Emerging
284 spring-petclinic/spring-petclinic-langchain4j

Spring Petclinic application with a chatbot powered by OpenAI's Generative...

45
Emerging
285 yuniko-software/bge-m3-onnx

ONNX implementation of the BGE-M3 multilingual embedding model and tokenizer...

45
Emerging
286 HKUDS/RLMRec

[WWW'2024] "RLMRec: Representation Learning with Large Language Models for...

44
Emerging
287 lh0x00/lightweight-embeddings

LightweightEmbeddings is a fast, free, and unlimited API service for...

44
Emerging
288 remete618/widemem-ai

Next-gen AI memory layer with importance scoring, temporal decay,...

44
Emerging
289 dmotz/emdash

๐Ÿ“š๐Ÿง™โ€โ™‚๏ธ Wisdom indexer โ€” use AI to organize text snippets so you can actually...

44
Emerging
290 sismetanin/sentiment-analysis-of-tweets-in-russian

Sentiment analysis of tweets in Russian using Convolutional Neural Networks...

44
Emerging
291 solygambas/python-openai-projects

13 projects using ChatGPT API, Whisper, Embeddings, and DALL-E with Python.

44
Emerging
292 biocentral/biocentral_server

Compute functionality for biocentral.

44
Emerging
293 gweidart/rs-bpe

A ridiculously fast Python BPE (Byte Pair Encoder) implementation written in Rust

44
Emerging
294 robert-mcdermott/embeddings_plot

A command line utility to create a plots of word embeddings

44
Emerging
295 ddangelov/RESTful-Top2Vec

Expose a Top2Vec model with a REST API.

44
Emerging
296 autonomio/signs

A suite of tools for text preparation, vectorization and processing for deep...

44
Emerging
297 colonelwatch/abstracts-search

Semantic search engine indexing 110 million academic publications

44
Emerging
298 snash4/GAT2VEC

embedding attributed graphs

44
Emerging
299 itayzit/openai-async

A light-weight, asynchronous client for OpenAI API - text completion, image...

44
Emerging
300 snu-mllab/KVzip

[NeurIPS'25 Oral] Query-agnostic KV cache eviction: 3โ€“4ร— reduction in memory...

44
Emerging
« Prev 1 2 3 4 5 40 41 42 Next »