Semantic Search Engines NLP Tools

Tools for building search systems that match semantic meaning and relevance using embeddings, neural networks, and dense/sparse retrieval methods. Does NOT include general information retrieval frameworks, traditional keyword-based search, or downstream NLP tasks like Q&A or summarization.

There are 51 semantic search engines tools tracked. 3 score above 50 (established tier). The highest-rated is smart-on-fhir/cumulus-etl at 60/100 with 22 stars.

Get all 51 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=nlp&subcategory=semantic-search-engines&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 smart-on-fhir/cumulus-etl

Extract FHIR data, Transform with NLP and DEID tools, and then Load FHIR...

60
Established
2 mirkosertic/FXDesktopSearch

A JavaFX based desktop search application.

57
Established
3 opensemanticsearch/open-semantic-search

Open Source research tool to search, browse, analyze and explore large...

51
Established
4 opensemanticsearch/open-semantic-etl

Python based Open Source ETL tools for file crawling, document processing...

49
Emerging
5 opensemanticsearch/open-semantic-search-apps

Python/Django based webapps and web user interfaces for search, structure...

46
Emerging
6 opensemanticsearch/open-semantic-entity-search-api

Open Source REST API for named entity extraction, named entity linking,...

46
Emerging
7 naver/splade

SPLADE: sparse neural search (SIGIR21, SIGIR22)

45
Emerging
8 AnthonySigogne/web-search-engine-ui

UI - a simple web search engine

41
Emerging
9 hannawong/ColXLM

Multilingual Retrieval on Yelp Search Engine ⚡

41
Emerging
10 davidemiceli/natural-language-flows

A very simple prototype of Natural Language Flow builder and Natural...

41
Emerging
11 RugvedMavidipalli/Search-Engine

Search Engine built using Java

39
Emerging
12 RameshAditya/scoper

Fuzzy and semantic search for captioned YouTube videos.

37
Emerging
13 o19s/hello-nlp

A natural language search microservice

37
Emerging
14 metehan777/google-rerank-tool

A Python cli-command tool for creating reports for any Google query.

35
Emerging
15 Leoglme/node-nlp-typescript

nlp.js from axa-group in typescript 🚀. NLP library for building bots 🤖, with...

35
Emerging
16 yiming-liao/zhq

完全運行於客戶端的中文檢索引擎

33
Emerging
17 lszoszk/UN-TreatyBodiesDocSearch

Application enabling to search through the General Comments/ Recommendations...

33
Emerging
18 cabeywic/knowledge-base-search

This project provides an efficient and scalable solution to search and query...

32
Emerging
19 george-gca/ai_papers_search_tool

Automatic paper clustering and search tool by fastext from Facebook Research

31
Emerging
20 jpoehnelt/related-documents

Find and rank text documents by similarity.

29
Experimental
21 lakshaychhabra/MLSearchEngine

This repo contains an NLP and ML based Search Engine for Stackoverflow Dataset.

29
Experimental
22 mdipietro09/App_StringsMatcher

String Matching Web App

28
Experimental
23 Yahia995/semantic-search-api

NLP-powered semantic document search using HuggingFace transformers and FAISS

25
Experimental
24 pradeep583/Search-It

A lightweight web search engine built using BM25 for keyword relevance, BERT...

24
Experimental
25 shreydan/youtube-in-video-search

YouTube Question-Answering and Semantic Search.

24
Experimental
26 czarinagluna/ml-powered-video-library

Machine learning-powered video library that returns accurate results given...

23
Experimental
27 MohammadMoataz2/KnowledgeKapture

KnowledgeKapture is an information retrieval system and search engine...

23
Experimental
28 IvanKotik/Word-cloud-Search-engine-optimisation-

Future project on search optimisation via NLP

23
Experimental
29 Anaskaysar/SciRet-Scientific-Information-Made-Easy

SciRet is a system that will retrieve authentic and informative data from a...

21
Experimental
30 altescy/tinysearch

🔍 Tiny python library for sparse/dense search

20
Experimental
31 thecloaq/cloaq-reranker

gRPC service that reranks documents by relevance

20
Experimental
32 Somespi/meliora

meliora is a command-line tool for sorting files based on their content. that's it.

19
Experimental
33 LLRHall/Astria

Astria - Intelligent Search Engine for Lawyers and Common people

19
Experimental
34 KvaytG/ru-wiki-search

Smart search on Russian Wikipedia.

19
Experimental
35 TelevisionNinja/search-engine

This is a basic search engine I made for my information retrieval class.

18
Experimental
36 deepindexer/deepi-wp

WordPress Plugin for Deepi Search. Upgrade your site's "lexical search" to...

17
Experimental
37 frans-johansson/code-query

Information retrieval on source code through natural language queries

17
Experimental
38 DevAsgari/ai-semantic-search-tool

Python-based semantic search tool using pretrained Sentence-BERT for vector...

15
Experimental
39 shruticreates01-ship-it/smart-search-ai

AI-powered natural language product search (demo + PRD + metrics framework)

15
Experimental
40 johannkm/goex-search

(Winner | Capital One) A Yelp search app that summarizes reviews using...

13
Experimental
41 IamOmaR22/Django-CRUD-and-TextUtils

CRUD Operations and Text-Utils in Django

12
Experimental
42 Hoaru/Academic-Search-Engine

A search engine

12
Experimental
43 RRFLV/project-search

Project Search is the code name for the search engine project in development...

11
Experimental
44 SwapnilVerma209/mini_search

An in-progress free and open source search engine.

11
Experimental
45 Enoch2090/MAGI

MAGI is an semantic searcher over GitHub.

11
Experimental
46 nico916/best_search_engine-

A "from-scratch" implementation of a search engine in Python. This project...

11
Experimental
47 fccapria/scientify

Modern platform for managing and sharing scientific publications 📚✨

11
Experimental
48 abhinav-bohra/CoeuSearch

Neural File Search Engine

11
Experimental
49 HannahIgboke/Semantic-Based-Video-Subtitle-Search-engine

Leveraged natural language processing and machine learning techniques to...

11
Experimental
50 al-alamin/StracerBot

This repository contains codes and documentation for research project...

10
Experimental
51 SakuraPuare/ApolloDatabase

Apollo 自动驾驶文档全文搜索平台 | Full-text search platform for Apollo autonomous...

10
Experimental