Semantic Search Applications Embedding Tools

Tools and applications implementing semantic search functionality across various domains (e-commerce, documentation, knowledge bases). Focuses on end-to-end search solutions using embeddings and vector databases. Does NOT include foundational embedding models, vector database infrastructure alone, or domain-specific RAG systems already categorized separately.

There are 195 semantic search applications tools tracked. 10 score above 50 (established tier). The highest-rated is deepset-ai/haystack-tutorials at 60/100 with 351 stars. 1 of the top 10 are actively maintained.

Get all 195 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=embeddings&subcategory=semantic-search-applications&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 deepset-ai/haystack-tutorials

Here you can find all the Tutorials for Haystack 📓

60
Established
2 aryn-ai/sycamore

🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.

58
Established
3 MaartenGr/PolyFuzz

Fuzzy string matching, grouping, and evaluation.

56
Established
4 unum-cloud/USearch

Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary...

55
Established
5 towhee-io/towhee

Towhee is a framework that is dedicated to making neural data processing...

54
Established
6 pingcap/pytidb

TiDB AI SDK: Unified Multi-Modal Data Platform for AI Apps & Agents -...

52
Established
7 deepset-ai/haystack-demos

Fully working applications that demonstrate how to use Haystack to implement...

52
Established
8 pinecone-io/pinecone-datasets

An open-source dataset library for pre-embedded dataset: create your own...

51
Established
9 hamelsmu/code_search

Code For Medium Article: "How To Create Natural Language Semantic Search for...

51
Established
10 towhee-io/examples

Analyze the unstructured data with Towhee, such as reverse image search,...

50
Established
11 AshokHub/locBLAST

Local NCBI BLAST+ Search

48
Emerging
12 datastax/astra-db-java

Java Client for Data API

48
Emerging
13 nomic-ai/semantic-search-app-template

Tutorial and template for a semantic search app powered by the Atlas...

46
Emerging
14 decodingai-magazine/tabular-semantic-search-tutorial

📚 Tutorial on building a modern search app for Amazon e-commerce products...

46
Emerging
15 raphaelsty/neural-cherche

Neural Search

46
Emerging
16 IBM/Semantic-Search-for-Sustainable-Development

Semantic Search for Sustainable Development is experimental code for...

43
Emerging
17 gentaiscool/distfuse

A library to calculate similarity scores between two collections of text...

43
Emerging
18 karlopintaric/omop-concept-automapper

An automated system for mapping source medical concepts to OMOP standard...

43
Emerging
19 DeveloperMindset-com/faiss-mobile

FAISS library compiled for iOS, macOS, tvOS, watchOS

43
Emerging
20 Azure-Samples/eShopLite-SemanticSearch

eShopLite - Semantic Search is a reference .NET application implementing an...

42
Emerging
21 agrawal-rohit/stackoverflow-semantic-search

Word2Vec encodings based search engine for Stackoverflow questions

41
Emerging
22 jbmiller10/semantik

Semantik is a self-hosted semantic search engine for your documents.

40
Emerging
23 machinelearningZH/ai-search_staatsarchiv

Intelligent Document Search for the Staatsarchiv Zurich.

40
Emerging
24 kbeaugrand/SemanticKernel.Connectors.Memory.SqlServer

SQL Server connector for Semantic Kernel plugin and Kernel Memory

40
Emerging
25 augustwester/searchthearxiv

The code powering searchthearxiv.com, a simple semantic search engine for...

40
Emerging
26 Bklieger/Semantic

SemanticPDF: Drag, Drop, Semantic Search - SemanticPDF is a simple,...

39
Emerging
27 LD-Reborn/embeddingsearch

An embeddings based search server written in C#

39
Emerging
28 Azure-Samples/eShopLite-SemanticSearch-AzureAISearch

eShopLite - Semantic Search is a reference .NET application implementing an...

39
Emerging
29 5am-code/ada-laravel

This package allows you to enhance your Laravel applications by seamlessly...

39
Emerging
30 raphaelsty/neural-tree

Tree-based indexes for neural-search

38
Emerging
31 huyhoang17/Semantic_Search

[DEPRECATED] Baseline Project for Semantic Searching

36
Emerging
32 vlados/laravel-related-content

Build related content links using vector embeddings and pgvector for Laravel

35
Emerging
33 0xku/information-retrieval

Neural information retrieval / Semantic search / Bi-encoders

33
Emerging
34 do-me/semantic-segment-explorer

In-browser tool to explore semantic similarity segmenting strategies by...

33
Emerging
35 soulteary/text-retrieval-example

Let's talk about text retrieval.

33
Emerging
36 novoselrok/codesnippetsearch

Neural bag of words code search implementation using PyTorch and data from...

33
Emerging
37 SnowLaboratory/Laravel-Mirror

Laravel recommendation engine - take your blog to the next level!

32
Emerging
38 SeanWong17/Semantic-Text-Deduplicator

一个基于 Transformer 模型(如BERT)和 FAISS 索引的高性能文本去重工具,专为处理大规模语料库中的语义重复问题而设计。

32
Emerging
39 sefgh-ai/hostSefgh

well this is for hosting sefgh in case main server is down

31
Emerging
40 dcarpintero/github-semantic-search

Semantic Search on Langchain Github Issues with Weaviate

31
Emerging
41 bhattbhavesh91/polyfuzz-string-matching-demo

Fuzzy string matching, grouping, and evaluation using PolyFuzz

31
Emerging
42 awaescher/oef

Ollama Embeddings Forwarder

30
Emerging
43 do-me/SDG-Analyzer

Frontend-only semantic similarity mapper for SDGs

30
Emerging
44 last-stand/webcrawler

Web Data Crawling and Vectorization

30
Emerging
45 PrithivirajDamodaran/SPLADERunner

Lite weight wrapper for the independent implementation of SPLADE++ models...

30
Emerging
46 hsm207/weaviate-txtai

An integration of the weaviate vector search engine with txtai

29
Experimental
47 mensonones/expo-vector-search

High-performance on-device vector search engine for Expo & React Native....

29
Experimental
48 Mojne/semantic-index

Lightweight, single-file vector database for experiments and small projects.

29
Experimental
49 amrohendawi/roberta-t5-faiss-semantic-search

This is a semantic search implementation using RoBERTa + T5 + FAISS

29
Experimental
50 digikar99/cl-docsearch

A tool to search documentation of lisp symbols in the current lisp image.

29
Experimental
51 Paulanerus/TextVariantExplorer

A tool designed for the exploration, analysis, and comparison of textual...

29
Experimental
52 bwconrad/manga-semantic-search

Search for a manga using a description of its story

29
Experimental
53 ofirbsh/ai_embedder_engine

AI Embedder Engine: An open-source Python engine for generating embeddings...

29
Experimental
54 Kauxtubh/pinecone

Experimenting with Pinecone as vector data continues to take center stage in...

28
Experimental
55 anakin87/who-killed-laura-palmer

Simple Question Answering system, based on data crawled from Twin Peaks...

28
Experimental
56 thinktecture-labs/semantic-kernel-semanticsearch

Example how to implement a question & answer flow using semantic search with...

27
Experimental
57 Navy10021/SLS

SLS : Neural Information Retrieval(IR)-based Semantic Search model

27
Experimental
58 galafis/Semantic-Search-Engine

Professional project by Gabriel Demetrios Lafis

26
Experimental
59 Huffon/semantic-search-faiss

Semantic Search using FAISS & ElasticSearch

26
Experimental
60 leungkimming/SK-DocumentSearch

Using Semantic Kernel to obtain answer from a PDF document, with embeddings...

26
Experimental
61 muazhari/semantic-search

Semantic Search with streamlit user interface

26
Experimental
62 easonlai/product_semantic_search_streamlit

This code repo demonstrates how to use the word embedding model from Azure...

25
Experimental
63 arnicas/simple-embedded-text-navigator

small embedded text app that uses fairy tales, with selection, to navigate...

25
Experimental
64 gaiaverseltd/semantic-turning-point-detector

The Semantic Turning Point Detector is a lightweight but powerful tool for...

25
Experimental
65 Trevato/csv_semantic_search

Semantic search app using streamlit and txtai.

24
Experimental
66 EdDuarte/semantic-graph

Semantic search web application with graph visualization in Django

24
Experimental
67 TolaniSilas/web3-semantic-search

The Web3 Semantic Search Project is a decentralized content retrieval system...

24
Experimental
68 machinelearningZH/ogd_ai-search

Semantic, lexical, multilingual search in your OGD metadata catalog.

24
Experimental
69 devicemxl/drs-backend

Experimental semantic search backend based on Deferred Re-computation Search...

24
Experimental
70 ElliotOne/nl-semantic-runbook-search

A local tool that finds the most relevant engineering runbooks using...

23
Experimental
71 Talina06/arxiv-semantic-search

Arxiv Semantic Search enables semantic search on arXiv research papers using...

23
Experimental
72 ashvardanian/SpaceV

Billion-scale Semantic Search dataset derived from Microsoft SpaceV for...

23
Experimental
73 easonlai/azure_openai_semantic_search_sample

This code repo demonstrates how to use the word embedding model from Azure...

23
Experimental
74 ireapps/ire-archive-backend

FastAPI backend for archive.ire.org

23
Experimental
75 svakulenk0/semantic_coherence

Measuring semantic (in)coherence in Ubuntu dialogue corpus using different...

23
Experimental
76 phifib/next-llm-app

nextjs template with langchain + typescript + zustand + tailwind

23
Experimental
77 hsm207/txtai-weaviate-docker-compose

A demo on how to integrate weaviate with txtai

23
Experimental
78 DHTMLX/gantt-semantic-search-ai-demo

Semantic search demo for DHTMLX Gantt - find tasks by meaning using embeddings

22
Experimental
79 gsavla6-hue/semantic-search-engine

Advanced semantic search engine with dense retrieval, cross-encoder...

22
Experimental
80 mashrulhaque/EasyAppDev.Blazor.AutoComplete

A high-performance, feature-rich AutoComplete component for Blazor...

22
Experimental
81 dgtlss/semantica

A Laravel package that enables semantic search using vector embeddings for...

22
Experimental
82 YertleTurtleGit/rajs

Semantic search for documents and transcription files inside your browser...

22
Experimental
83 aaronroman/semantic-search-langchain

Building a semantic search engine using LangChain and OpenAI

22
Experimental
84 venkatesh-hyper/paperlens

Semantic search engine for research papers — find papers by meaning, not keywords.

22
Experimental
85 r0hankrishnan/racquet-sem-search

[WIP] Hybrid search over tennis racquets (structured filters + semantic...

22
Experimental
86 ian-cowley/LinksAndMore-

Organize links and more

22
Experimental
87 ElliotOne/nl-embeddings-vector-search-feedback-triage-engine

A local-first C# semantic feedback triage engine using embeddings and cosine...

22
Experimental
88 codebased-sh/codebased

Embedded AI search engine for code

22
Experimental
89 DavidChen617/eShopX

A scalable e-commerce system built with Clean Architecture, CQRS, and modern...

22
Experimental
90 gsavla6-hue/vector-search-engine

High-performance semantic vector search engine with hybrid search, filtering...

22
Experimental
91 guilhermebkel/semantic-search-llm-ai

🤖 That's a simple LLM semantic search that implements RAG concepts with help...

21
Experimental
92 Phrase-in-Context/eval

EACL 2023

21
Experimental
93 nyo16/faiss_ex

Elixir NIF bindings for FAISS — Facebook's library for efficient similarity...

21
Experimental
94 kieranpcremin/semantic-search-using-FAISS

Semantic search engine for technical documents using SentenceTransformers...

21
Experimental
95 garystafford/twelve-labs-bedrock-opensearch-demo

How TwelveLabs AI Models on Amazon Bedrock and OpenSearch enable...

21
Experimental
96 reformetech/haystack

🛠️ Build powerful search systems effortlessly with Haystack, a framework for...

21
Experimental
97 dfeen87/Semantic-Dropdown-Search

Semantic Dropdown Search is a schema-driven, open-source framework for...

21
Experimental
98 manueljcmatos/neural-search

```

21
Experimental
99 Anshu-312/llm-semantic-search

A lightweight FastAPI service for ingesting documents, creating embeddings...

21
Experimental
100 garystafford/nova-mm-embedding-model-demo

Demonstrating the use of Amazon Nova Multimodal Embeddings and TwelveLabs...

21
Experimental
101 amitportal/doc-semantic-search

A semantic search application that allows users to upload documents and...

21
Experimental
102 petlukk/eavec

Fast vector similarity search. SIMD kernels via Eä, called from Python....

21
Experimental
103 DevExpress-Examples/asp-net-web-forms-grid-semantic-search

Integrates AI-powered semantic search into the ASP.NET Web Forms Grid View...

21
Experimental
104 iai-group/ecir2018-intents

Towards an Understanding of Entity-Oriented Search Intents - ECIR'18

21
Experimental
105 haolamnm/jneurite

A simple vector database indexer with Ollama written in Java

20
Experimental
106 lachlanjc/spatial-twitter-search

Prototype of embeddings-based search of Twitter likes/bookmarks on an infinite canvas

20
Experimental
107 ayanalamMOON/multilingual-search-engine

A modern semantic search engine for discovering songs and poems across...

20
Experimental
108 lambda-capture/semantic-search-api

🔎 Free API for Text-retrieval & Semantic Search of Macro Data for Quant Research

20
Experimental
109 montraydavis/SemanticKernel_SqliteVec_Example

In-depth demonstration of C# Semantic Kernel SQLiteVec Hybrid Search...

20
Experimental
110 claire-np/semantic-course-navigator

Semantic course search and role-based learning-path generation using SBERT...

20
Experimental
111 dannykd/zotsearch

UCI course discovery platform with NLP via OpenAI. Discover any of the 6,140...

20
Experimental
112 ahmedeldamaty20/Intelligent-Search-and-Insights-Engine

A production-ready e-commerce search and analytics platform built with .NET...

19
Experimental
113 do-me/cordis-semantic-search

A simple semantic search application for CORDIS running entirely in the browser

19
Experimental
114 nabito/hls

Human Localization Sensor Ontology (HLS)

19
Experimental
115 jonathanlimsc/arxiv-scout

A microservice API that allows you to query the latest Arxiv AI articles...

19
Experimental
116 do-me/copernicus-services-semantic-search

A basic semantic search app based on 834 entries from Copernicus Services Catalogue

19
Experimental
117 LiveisFpv/ALib

A system for semantic search of scientific publications across large...

19
Experimental
118 Dherya27/Retrieval-based-Object-Recognition-and-Reconstruction-via-VAE-and-FAISS

Developed a robust system for object recognition and reconstruction based on...

18
Experimental
119 fusion-jena/semantic-search-usability-analysis

Supplementary material for a usability evaluation of a semantic search for...

18
Experimental
120 fusion-jena/daisi-semantic-search

Semantic Search Extension for Dataset Search UI [Dai:Si]

18
Experimental
121 julian-8897/arxiv-semantic-search

An LLM-powered semantic search tool for arXiv papers using sentence...

18
Experimental
122 ddmitov/fupi

Serverless multilingual semantic search based on LanceDB

18
Experimental
123 GradientFlow-ai/eaas

NextJS-based frontend

18
Experimental
124 zack-zzq/TrueFuzzyMatch

TrueFuzzyMatch is a powerful tool for fuzzy matching material names between...

17
Experimental
125 massimobonanni/EmbeddingAnalyzer

This repository contains a console application that calculates the cosine...

17
Experimental
126 JMJuarez/modulo_pln_vf

Buscador semántico básico de frases específicas en español usando NLP y...

17
Experimental
127 JashT14/VectorVault

VectorVault - Offline Semantic Search Engine with GloVe Embeddings

17
Experimental
128 vedaant00/uhsr

UHSR (Unified Hyperbolic Spectral Retrieval) is a next-generation hybrid...

17
Experimental
129 Krixna-Kant/radiology-trust

A privacy-preserving AI system for secure radiology case retrieval and...

17
Experimental
130 yash-srivastava19/Semantic_Search

A jupyter notebook implementing semantic search(with visualization) using...

17
Experimental
131 Muhomorik/SemanticKernel-FundDocsQnA-dotnet-nextjs

AI-powered Q&A over investment fund factsheets (PRIIP/KID documents)....

16
Experimental
132 angelafeliciaa/xCreator

UGC Marketplace for X

16
Experimental
133 DareDev256/vector-vs-keyword-search

Side-by-side comparison of semantic vector search vs BM25 keyword search ...

16
Experimental
134 SaharZargarzadeh/semantic-movie-search-hackathon6

A Streamlit-based semantic search app for movies built with Pinecone,...

15
Experimental
135 Betawi10/Nyc311-Infrastructure-Hotspots-Google-Cloud-BigQuery-Hackathon-PoC

🛠️ Transform NYC 311 complaints into ranked infrastructure hotspots using...

15
Experimental
136 DFMERA/azure-ai-search-embeddings

Azure AI Search: Cómo optimizar la búsqueda para tus aplicaciones RAG

15
Experimental
137 zaafira12/Semantic-news-search

"A scalable semantic search engine for news articles using transformer...

14
Experimental
138 aaryanved/originality-engine

A system to measure conceptual originality using semantic, structural, and...

14
Experimental
139 codebywiam/semantic-search-faiss

This project implements a semantic search pipeline using the 20 Newsgroups...

13
Experimental
140 viniciusfinger/NER-powered-semantic-search

Named Entity Recognition powered Semantic Search

13
Experimental
141 AalokBaxi/Local-Semantic-Search

Local-first hybrid semantic search using .NET 10, ONNX Runtime, and SIMD.

13
Experimental
142 Rolika15/holosemantic

🌐 Build a holistic semantic system for managing data and actor objects...

13
Experimental
143 r0hankrishnan/racket-semantic-search

(WIP) Using semantic search to find the right tennis racket from Tennis Warehouse.

13
Experimental
144 AutohostAI/langchain-vector-search

Serverless API for indexing and searching documents with LLM

13
Experimental
145 Disha-04/ai-semantic-search

AI-powered semantic search and Q&A application using LLMs, vector search,...

13
Experimental
146 Sanschinu95/maxcrawler

Research-Grade Async Web Crawler with AI Summarization, PDF Extraction,...

13
Experimental
147 Ayush-srivastava504/AI-Search-Engine

AI-powered search engine for technical articles using FastAPI,...

13
Experimental
148 eddiedunn/engram

Knowledge corpus service with vector search — FastAPI + PostgreSQL +...

13
Experimental
149 wmjg-alt/nlp-systems-review

NLP lessons with implementations of Information Retrieval algorithms, Data...

13
Experimental
150 fridalyf412/manwen-viewer

Streamlit tool for keyword/semantic search, transliteration, and AI...

13
Experimental
151 dappros/site_crawler

Site crawler used in Ethora platform as an option to import your specific...

13
Experimental
152 VasanthPrabahar/Distributed_E-Commerce_SemanticSearch_Platform-

Distributed e-commerce semantic search system built incrementally using...

13
Experimental
153 machnevegor/ssssearch

Static Site Semantic Search

13
Experimental
154 samiksengupta/laravel-ai-demo

A Demo showcasing use of Laravel AI SDK, embeddings and vector search

13
Experimental
155 Wesley-Nunes/news-semantic-search

A semantic search system for news articles using BBC News dataset and...

13
Experimental
156 gedankrayze/splade-rest-api

Memsplora - An in-memory SPLADE (SParse Lexical AnD Expansion) content...

13
Experimental
157 michelderu/vector-wildcard-search

Wildcard search for Cassandra - using semantic recall and lexical filtering

13
Experimental
158 ANSHIKA1220/cluster-aware-semantic-search

Cluster-aware semantic search system using embeddings, FAISS vector search,...

13
Experimental
159 Poonam7828/vectormind-semantic-search

AI-powered Semantic Search Engine using MongoDB, Sentence Transformers, and...

13
Experimental
160 clxmente/tuffysearch-api

🎓 A FastAPI-powered semantic search API for CSUF course catalog, enabling...

13
Experimental
161 AgesFranciscoTeran/ecuador-news-nlp-pipeline

Modular NLP pipeline for large-scale historical newspaper processing with...

13
Experimental
162 evokateur/what-was-that-word

A terminal program that embeds a word list and does semantic search over it.

13
Experimental
163 faraaawo-debug/ai-act-semantic-search

Semantic search system to retrieve relevant passages from the European AI Act

13
Experimental
164 mateus-holanda/kassa

Kassa Assessment: AI-powered Furniture Search

13
Experimental
165 primaryobjects/semantic-search

Semantic search web app using the Large Language Model (LLM) Cohere for...

13
Experimental
166 sreelakshmisajith05/trademarkia-semantic-search

Semantic search over 20 Newsgroups- fuzzy clustering, vector store &...

13
Experimental
167 ajeet214/elastic_semantic_search

An LLM-powered semantic search system that bridges Elasticsearch with Azure...

12
Experimental
168 Khushmeet-patil/Sementic-Search-task

A Streamlit-based semantic search engine that converts documents into...

12
Experimental
169 elango5292/sg-medishield-semantic-search

Semantic search for MediShield Life policy document

12
Experimental
170 danny-1k/autocomplete_hist

Autocomplete engine trained on my google history

11
Experimental
171 upstash/laravel-semantic-emoji

An example of how do use our Vector SDK to do semantic search on emoji data

11
Experimental
172 bard/grants-stack-search

Hybrid semantic (vector-based) / full-text search for Gitcoin projects

11
Experimental
173 FUYOH666/DT-xml

AI-powered semantic search system for customs declarations (EAEU). Helps...

11
Experimental
174 utyfua/react-paperform-co

Paperform.co components for react with types

11
Experimental
175 twelvelabs-io/tl-marengo-bedrock-s3

Multimodal AI embeddings with TwelveLabs Marengo on Amazon Bedrock. Generate...

11
Experimental
176 ThomasVitale/package-for-weaviate

Kubernetes-native package for Weaviate, an AI-native vector database that...

11
Experimental
177 tbrouns/artsbot

Semantic search NLP model for thuisarts.nl

11
Experimental
178 surajsrivathsa/thesis_deployment

Frontend and Backend for comic book semantic search engine. Renders...

11
Experimental
179 sun-wendy/semantic-search

Final project for 9.66 - Computational Cognitive Science (fall 2024)

11
Experimental
180 YahyaAlaaMassoud/learn-search-relevance

Exploring search relevance techniques.

11
Experimental
181 BerntA/IR-SMART

Semantic Answer Type Prediction

11
Experimental
182 Sharan-Babu/CycloneSearch

Search and Compare Cyclones

11
Experimental
183 hsm207/haystack-weaviate-docker-compose

How to use configure haystack to use weaviate

11
Experimental
184 Aml-Hassan-Abd-El-hamid/The-finder

ML web-based system that can find similar products based on user inputs

11
Experimental
185 chrisammon3000/aws-open-data-registry-neural-search

Semantic search of AWS Open Data Registry datasets using Weaviate

11
Experimental
186 Natanaelvich/product-catalog-embeddings-node

A Node.js application that uses OpenAI embeddings to create a semantic...

10
Experimental
187 Yusuf-YENICERI/Multilingual-Semantic-Search

Semantic search with open source or openai

10
Experimental
188 MichaelMwb/LLM-Quote-Retrieval

Semantic search engine for ~500k quotes using SentenceTransformers...

10
Experimental
189 DimitrisLianos/Nyc311-Infrastructure-Hotspots-Google-Cloud-BigQuery-Hackathon-PoC

NYC311 Infra Hotspots: End-to-end BigQuery AI pipeline that turns 311...

10
Experimental
190 AndrMoura/songsearch

Song lyrics semantic search using Haystack.

10
Experimental
191 ansenya/cdc-demo

демострация работы CDC (change data capture) и семантического поиска

10
Experimental
192 bilgeyucel/document-search-demo

Document Search Pipeline Using Haystack

10
Experimental
193 eyuuab/Semantic-Email-Tagging-System

Email Tagging system using Semantic search

10
Experimental
194 cjber/cdrc-semantic-search

The CDRC Semantic Search System is a project designed to enhance the search...

10
Experimental
195 davidbriangarcia/semantic-search-research

Research

10
Experimental

Comparisons in this category