Uncategorized Embedding Tools

There are 294 uncategorized tools tracked. 2 score above 50 (established tier). The highest-rated is tetherto/qvac at 54/100 with 63 stars.

Get all 294 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=embeddings&subcategory=uncategorized&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 tetherto/qvac

QVAC - Local AI SDK and libraries for building private, cross-platform,...

54
Established
2 sysid/bkmr

A Unified CLI Tool for Bookmark, Snippet, and Knowledge Management

53
Established
3 cerul-ai/cerul

Real-time video search engine for AI agents. Search by meaning across visual...

49
Emerging
4 joshuaswarren/remnic

Local-first memory plugin for OpenClaw AI agents. LLM-powered extraction,...

49
Emerging
5 amarpatel-xx/generator-jhipster-ai-postgresql

JHipster blueprint that enhances entity relationships by displaying multiple...

42
Emerging
6 teranos/QNTX

QNTX = Experiential ꩜ Learning ⌬ System ≡ Attestation +

42
Emerging
7 MakiDevelop/knowledge-pipeline

Stop feeding your RAG garbage. A zero-framework, 6-layer deterministic...

41
Emerging
8 robzilla1738/Memorwise

A local, open-source alternative to NotebookLM. Chat with your documents...

40
Emerging
9 elbruno/ElBruno.LocalLLMs

C# local LLM chat completions library using ONNX Runtime, compatible with...

40
Emerging
10 BryanChasko/kiro-cli-notes

Professional Kiro CLI setup guide based on 10 tutorial videos with proper...

39
Emerging
11 alsoleg89/ai-knot

Agent knowledge layer — extract facts from conversations, store them,...

38
Emerging
12 zlaabsi/turboquant-wasm

TurboQuant vector quantization for browser and edge runtimes

38
Emerging
13 danilodevhub/turboquant-js

TypeScript implementation of Google's TurboQuant algorithm for near-optimal...

37
Emerging
14 lucidrains/discrete-continuous-embed-readout

Embedding and readout for simple multi-categorical and gaussian continuous

37
Emerging
15 ElieElDebs/Good-Karma

Good Karma is a SaaS that analyze your Reddit's Post and gives you KPI and Advices

37
Emerging
16 hackerlibs/rag-code-sorting-search

RAG code sorting search, RAG knowledge organization

37
Emerging
17 orkait/graphstore

Memory infrastructure for AI agents!! store, recall by meaning or...

36
Emerging
18 samzong/Recall

Local-first TUI for searching AI coding session history across Claude Code,...

35
Emerging
19 vulture-s/arkiv

Local-first media asset manager with AI-powered semantic search. DaVinci...

35
Emerging
20 rsasaki0109/rustclaw

Memory search engine with hybrid vector/keyword search, MMR re-ranking, and...

35
Emerging
21 elbruno/elbruno.localembeddings

.NET library for local embedding generation using ONNX and Microsoft.Extensions.AI

34
Emerging
22 retran/meowary

AI-powered work journal template for software developers. PARA structure,...

34
Emerging
23 Hyper3Labs/clawdrive

Google Drive for AI agents. Store any file and search by meaning across modalities.

34
Emerging
24 ZengLiangYi/ChatCrystal

Crystallize knowledge from AI conversations. Import from Claude Code /...

33
Emerging
25 LLMSystems/llm_tools

A comprehensive Python toolkit for LLM integration with chat, embeddings,...

33
Emerging
26 MushroomFleet/djz-Aesthetic-Embeddings

Curated aesthetic embedding collection for Stable Diffusion — style...

32
Emerging
27 amarpatel-xx/generator-jhipster-cassandra

JHipster blueprint for advanced Apache Cassandra support — composite primary...

31
Emerging
28 alez007/modelship

Self-hosted, multi-model AI inference server. Run LLMs, TTS, STT,...

30
Emerging
29 amarpatel-xx/jhipster-ai-postgresql-example

This code was generated using the JHipster blueprint...

29
Experimental
30 gloignon/ALSI

ALSI/ILSA is a lexical and syntactic analyzer

29
Experimental
31 Flissel/la_fungus_search

Semantic code search engine — FAISS, EmbeddingGemma, BM25, Streamlit UI

29
Experimental
32 amarpatel-xx/jhipster-cassandra-example

An example JHipster application generated with the...

29
Experimental
33 CloverIris/SeekStar

寻星SeekStar-互联网探索引擎 寻星 是一种面向未来的浏览器 / 搜索引擎形态构想。通过AI驱动的3D...

28
Experimental
34 Govcraft/crately

CLI tool for downloading, vectorizing, and semantically searching Rust crate...

28
Experimental
35 NithinGowda67/WebSage-AI-Scraper

WebSage is an AI-powered web scraping and analysis platform built with...

28
Experimental
36 villagesql/vsql-ai

AI prompting and text embeddings in MySQL via Claude, Gemini, OpenAI, Ollama

27
Experimental
37 spearzy/Axiom

Deterministic fluent assertions for .NET tests with batching, analyzers,...

27
Experimental
38 egwajnphoiu/Semantic-Gate-IP-Core

Enforce semantic integrity in LLM outputs with a hardware IP core that...

26
Experimental
39 dev-diaries41/sereleum-core

Core engine that powers the Sereleum app, providing the indexing, clustering...

26
Experimental
40 wh1le/better-web

Terminal tool for cutting through SEO and AI noise. Scrape, Score, Digest,...

26
Experimental
41 uchebnick/unch

Local-first semantic code search for repository annotations via GGUF...

26
Experimental
42 CH-RAFAY/Semantic-Book-Recommender

Semantic book recommendation system using LLM embeddings, zero-shot...

26
Experimental
43 MacPaw/ai-sdk-typescript

Official TypeScript SDK for AI Gateway - universal client for browser and...

26
Experimental
44 raimonvibe/chatbot-java-spring-ai

Christian AI Chatbots with Biblical Wisdom. Transform your ministry or...

26
Experimental
45 archmaxai/archmax

A semantic layer for your databases. Describe your data once, let AI agents...

26
Experimental
46 activeloopai/hivemind

Hivemind is a plugin for Claude Code, Codex and OpenClaw to add persistent...

25
Experimental
47 heitor-am/docsmith-agent

RAG agent with LangGraph for document intelligence — semantic search,...

25
Experimental
48 anetrebskii/llmdex

Local semantic search for your codebase. No API keys, no cloud. Built for...

25
Experimental
49 jayzeng/agentmemory

agentmemory: persistent memory for coding agents (Claude Code, OpenAI Codex,...

25
Experimental
50 modagavr/tomonome-knowledge-base

Structured marketing knowledge base optimized for AI, LLMs, agents, and...

25
Experimental
51 haripatel07/bugreport-ai

Production-ready full-stack AI debugging platform with FastAPI, React,...

25
Experimental
52 datobueno/goodpoint

Open-source tool to extract, organize, search, and cluster PDF highlights...

24
Experimental
53 nitin27may/e-commerce-agents

6 specialized AI agents collaborating via A2A protocol to power e-commerce —...

24
Experimental
54 bansal1806/DHUND

DHUND — AI-powered missing person recovery system using multimodal vision,...

24
Experimental
55 retospect/acatome-store

Paper storage backend for acatome — SQLite/Postgres, vector search, notes

24
Experimental
56 Imadberkani/sysrag

A modular retrieval-augmented pipeline for question answering on recent BBC News.

24
Experimental
57 sandhya-gunde/data-ingestion-kg-pipeline

AI-Based Knowledge Graph and Semantic Search System using TF-IDF, Neo4j, and...

24
Experimental
58 6v17/VideoSeek

Local-first semantic video search desktop app with CLIP + FAISS (text/image...

24
Experimental
59 sachinsharma9780/memweave

memweave is a zero-infrastructure, async-first Python library that gives AI...

24
Experimental
60 joungminsung/SemanticFS

FUSE-based semantic filesystem — access files by meaning, not paths. 자연어로...

24
Experimental
61 ivanzwb/agent-memory

TypeScript library providing persistent memory for AI agents — conversation...

24
Experimental
62 Aeslo/FileManager

Smart local file explorer that uses machine learning to automatically...

24
Experimental
63 myersm0/what-was-said

A framework for storage, retrieval, inference, and temporal reasoning over...

24
Experimental
64 Memact/Website

Official Website of Memact. (Still under construction)

24
Experimental
65 christianfitaram/api-semantic

Semantic API for embedding-based retrieval and vector similarity search,...

24
Experimental
66 urantia-hub/api.urantia.dev

API for the Urantia Papers — full-text search, semantic search, entities,...

24
Experimental
67 AD-Styles/nlp-semantic-search

"A Semantic Search system utilizing Sentence-Transformers and Cosine...

23
Experimental
68 nikuscs/scanr

📡 Semantic codebase search + TypeScript structural analysis — embeddings,...

23
Experimental
69 zlaabsi/turboquant-embed

TurboQuant embedding compression and RAG retrieval benchmarks

23
Experimental
70 fdsimoes-git/hate-speech-detector

Detect hate speech in video/audio using Whisper transcription, multilingual...

23
Experimental
71 fcys3258/file-search-agent

本地智能文件搜索系统 - 支持语义向量搜索、LLM 自动标签、GUI 界面

23
Experimental
72 MineProject17/sportguard-ai

🛡️ SportGuard AI – Semantic Sports Media Integrity Platform | Hack2Skill...

23
Experimental
73 Ghost-Frame/engram

Description: Persistent memory system for AI agents. store, search, and...

23
Experimental
74 raya-ac/engram

cognitive memory system with hybrid retrieval and neural visualization

23
Experimental
75 aggarwalkartik/rekall

A second brain that builds itself. Your AI tools remember everything you've...

23
Experimental
76 CallmeShini/nextjs-docs-rag

Open-source agentic RAG for the official Next.js documentation, with...

23
Experimental
77 KerberosClaw/kc_rag_lab

Local-first RAG pipeline, hand-built from scratch — no LangChain, no paid...

23
Experimental
78 nahidprince7/ollama-notes-laravel-package

Laravel AI Notes — AI-powered voice & text notes with semantic vector...

23
Experimental
79 ly85206559/memory-path-engine

Structured memory for agents: weighted retrieval and replayable evidence paths

23
Experimental
80 free-revalution/BoostChain

C++ LLM Agent Framework — Build AI agents, chains, and tools with OpenAI,...

23
Experimental
81 better-with-models/TinyQuant

TinyQuant is a CPU-only vector quantization codec that compresses...

23
Experimental
82 WynexLabs/cortex

Claude Code plugin for persistent, cross-machine memory. Syncs markdown...

23
Experimental
83 salonyranjan/RoleRadar

🎯 RoleRadar moves beyond keyword matching. It uses AI Agents and MCP to...

23
Experimental
84 kementbahri/semanticlayer

An open-source extraction engine and protocol that cleans up junk HTML in...

23
Experimental
85 hyeonseo2/glossary-generator

Build and maintain bilingual terminology from multilingual documentation...

23
Experimental
86 Rookiecoder-jsjs/GraphRAG

一个支持多用户的知识图谱系统,整合 Neo4j 图数据库和 ChromaDB 向量数据库,实现文档知识管理、可视化检索和 RAG 智能问答。

23
Experimental
87 Preston2012/demiurge

Adaptive memory for AI agents. Quality-gated, refusal-first, cross-model....

23
Experimental
88 RyjoxTechnologies/Octopoda-OS

The open-source memory operating system for AI agents. Persistent memory,...

22
Experimental
89 199-biotechnologies/zerank-2-mlx

Fast MLX port of ZeroEntropy zerank-2 cross-encoder reranker. 10x faster...

22
Experimental
90 Straying-bodypad392/vemb

Embed text, images, audio, video, and PDFs from the command line with vemb,...

22
Experimental
91 Nowhitestar/openclaw-memory-palace

A local-first memory upgrade for OpenClaw with semantic retrieval, link...

22
Experimental
92 ghostmountain3/gemma_web_cli

A search-enabled local CLI assistant for Ollama Gemma models. It keeps...

22
Experimental
93 erayaydn0/obsidian-vault-search

Hybrid semantic search plugin for Obsidian. BM25 + on-device vector...

22
Experimental
94 magicpro97/copilot-session-knowledge

Turn your AI coding sessions into a searchable knowledge base. FTS5 +...

22
Experimental
95 riyonp23/Probe

CLI tool that indexes any codebase into a local vector database and answers...

22
Experimental
96 DanielCardonaRojas/codemark

A structural code bookmarking system for humans and agents

22
Experimental
97 Harry-Zhao-AU/ResumeGraph

HR skill graph tool — generates fake pdf resumes, extracts employee-skill...

22
Experimental
98 BlackRoadOS/roadview

Sovereign search engine. G(n) calculator, fact-checking, blockchain verification.

22
Experimental
99 Adityansh-Chand/enterprise-rag-knowledge-system

Modular RAG pipeline with semantic retrieval, reranking abstraction,...

22
Experimental
100 cbwinslow/nautalis

Universal AI Agent Memory & Orchestration Platform

22
Experimental
101 Yukigeshiki/document-query-engine-python

A hybrid RAG system powered by LlamaIndex, Neo4j, and pgvector - handles...

22
Experimental
102 moosenet-io/lumina-engram

Semantic memory system for autonomous agents — sqlite-vec, 1536-dim embeddings

22
Experimental
103 willfanguy/obsidian-vault-mcp

MCP server for semantic search over an Obsidian vault using LanceDB and...

22
Experimental
104 YIING99/knowmine-claude-plugin

KnowMine Claude Code Plugin — personal knowledge base with semantic search via MCP

22
Experimental
105 Rohan-Boddu/mmf

Lightweight RAG-style AI system using TF-IDF, dynamic learning, and document...

22
Experimental
106 stasinosntaveas/smart_notepad

RAG-based note system with TF-IDF, embeddings, FAISS, and LLM answers

22
Experimental
107 luannamorim/docquery

Production-ready RAG system for technical documentation with hybrid...

22
Experimental
108 GOPIKA372/endee

AI-powered codebase assistant using RAG, embeddings, and semantic search

22
Experimental
109 AI-Nikitka93/ai-second-brain-bot

Telegram second-brain bot on Cloudflare Workers with AI summaries, OCR,...

22
Experimental
110 Datasculptures/reduction-quality-bench

RQB is a command-line tool that measures how faithfully a dimensionality...

22
Experimental
111 koltyakov/quant

Local-first RAG index that watches your files and serves MCP semantic search

22
Experimental
112 kahramanfaruk/AIAgentLab

Modular RAG Q&A system for answering questions over documents with local...

22
Experimental
113 koray-kaya/hybrid-search-benchmark

Hybrid search (BM25 + dense retrieval + RRF) benchmarked on 3,153 Swiss...

22
Experimental
114 Columba1198/EgaraNet-Demo

This directory contains the Web Demo for EgaraNet, implemented as a React...

22
Experimental
115 Amylalcoholfinance405/Claude-Code-Skill-handwriting-to-latex

Convert handwritten notes and scanned PDFs into clean, Overleaf-ready LaTeX...

22
Experimental
116 nlink-jp/gem-rag

Gemini-powered RAG CLI for Markdown documents — index, search, and answer...

22
Experimental
117 codegraph-ai/CodeGraph

CodeGraph builds a semantic graph of your codebase — functions, classes,...

22
Experimental
118 Datasculptures/Latent-Language-Explorer-v2

There are ideas that exist but do not have words. Not because they are vague...

22
Experimental
119 Atum246/memoryhub

🧠 Persistent memory for AI agents. One API. Every agent remembers. Long-term...

22
Experimental
120 tpriyadata/Hands-on-large-language-models

Code, notebooks, and exercises following the book 'Hands-On Large Language...

22
Experimental
121 significant-mi454/n2-QLN

Route semantic tool requests through one MCP router to simplify AI access to...

22
Experimental
122 Anne4188/semantic-kg-ai

The system ranks semantic hyponyms using both **graph proximity** and...

22
Experimental
123 Aashishh1/ml4llm

50 ML Projects to Understand LLMs

22
Experimental
124 Eric-meiyan/picseek

Local image semantic search powered by Chinese-CLIP. Search your photos with...

22
Experimental
125 numarulunu/kontext

Persistent context for Claude Code. Your history is my edge.

22
Experimental
126 AdelElo13/neurohive

Multi-agent memory intelligence — shared knowledge, expertise tracking, and...

22
Experimental
127 Zhanassy1/enterprise-copilot

Self-hostable B2B AI workspace: semantic search & RAG chat over PDF/DOCX...

22
Experimental
128 thibaultherve/SynapseAI

🧠 AI-powered research paper platform, import, analyze, and chat with...

22
Experimental
129 Omkar-ratzar/Gnosis

A lightweight semantic search engine

22
Experimental
130 RNA4219/affect-wave

An affect-expression sidecar for LLMs that infers affect from conversation...

22
Experimental
131 infektyd/sovereign-memory

Local-first memory system for OpenClaw agent swarms. Built by the agents,...

22
Experimental
132 peteroyce/clipsearch

Multi-modal semantic search engine powered by CLIP embeddings. Index images...

22
Experimental
133 Lumi-node/hermeneutica

3D interactive Bible explorer: 31K verses mapped by semantic meaning, 549K...

22
Experimental
134 AhmedToto23/Arabic-Audio-Search-Engine

AI-powered semantic search inside Arabic audio using Whisper + FAISS + Streamlit

22
Experimental
135 antoine126/embedding-from-scratch

Entraîner un modèle d'embedding — code source du livre

22
Experimental
136 senna-lang/Codeatrium

Memory palace for AI coding agents — index sessions, recall code context in ~0.2s

22
Experimental
137 Nexgale/nexgale-nex

Experimental AI-native single-file memory objects for small filesystem-based...

22
Experimental
138 Virgil-LIBRIA/chambre

Chambre Reverberante — moteur de resonance semantique

22
Experimental
139 honlam6/hanyang-course-review-system

Data Origin Wording If you need one short sentence about the overall...

22
Experimental
140 Liamhbray/second-thoughts

An Obsidian plugin that reads your vault, understands the relationships...

22
Experimental
141 omerhananya/marvin

Semantic Video Search

22
Experimental
142 Delibread0601/askaipods

Search AI podcast quotes by topic — find what Lex Fridman, Dwarkesh Patel,...

22
Experimental
143 cumlaude-hair539/howtouseai

Learn AI with clear guides and practical workflows for beginners and...

22
Experimental
144 pashunechka/inferis-ml

Worker pool for running AI models in the browser — WebGPU/WASM...

22
Experimental
145 tryAGI/EdenAI

C# SDK for the EdenAI API -- unified AI gateway with 500+ models across...

22
Experimental
146 kingsdigitallab/eb-pre

Computing Britannica - Exploratory work

22
Experimental
147 specivo/specivo

Your team's knowledge, finally findable. Self-hosted project tracking, wiki,...

22
Experimental
148 saurabhdxt-labs/memory-mesh

Local AI continuity for Claude Code - auto-captures sessions, structures...

22
Experimental
149 khai-nguyen-dinh/narrafind

A hybrid semantic video search engine combining Gemini Vision and OpenAI...

22
Experimental
150 justi/armillary

🔭 Project observatory with AI integration — one terminal command, one...

22
Experimental
151 endless-galaxy-studios/neuroloom-sdlc-plugin

cc-sdlc process knowledge forgets nothing. Deliverables, playbooks, and...

22
Experimental
152 Aldeia-IT/anytype-rag

Semantic search on indexed data from anytype

22
Experimental
153 Evanciel/stellavault

Notes die in folders. Stellavault keeps your knowledge alive — 3D knowledge...

22
Experimental
154 openfihris/openfihris

The open index for AI agents. Discover, install, and publish agents, skills,...

22
Experimental
155 msalvatti/ai-product-assistant

AI-powered multi-tenant SaaS platform where companies manage product...

22
Experimental
156 heinko/my-emoji-search

A locale-aware emoji search experience which will mostly locally in the browser.

22
Experimental
157 braintied/watchtower

Watchtower — Open-source AI coding session intelligence. Auto-captures...

22
Experimental
158 sterben-enec/obsidian-semantic-memory

Local-first semantic memory for Obsidian: SQLite + vector search + FTS +...

22
Experimental
159 JSLEEKR/memmachine-go

Re-implementation of MemMachine in Go — AI agent memory layer with graph...

22
Experimental
160 saagpatel/AssistSupport

Local-first IT support assistant — ML intent classification, sub-25ms...

22
Experimental
161 urmzd/mnemonist

An open ecosystem for tool-agnostic AI agent memory

22
Experimental
162 rdreilly58/momo-mukashi

Fast Memory Recall for AI Systems Using Hashing

22
Experimental
163 Shaisolaris/ai-rag-system

RAG system — document chunking, OpenAI embeddings, vector store, cosine...

22
Experimental
164 xiaojiou176-open/agent-exporter

Local-first Rust CLI and archive workbench for AI agent transcripts, archive...

22
Experimental
165 meshgraph/meshgraph

Open-source organizational memory. Self-hosted. AI-powered. Your code, your...

22
Experimental
166 Javihaus/cert-langchain

CERT hallucination detection for LangChain and LangSmith — embedding...

22
Experimental
167 Raffaele86/rag-brain-mcp

Production-grade MCP server for knowledge management with semantic search,...

22
Experimental
168 Shaisolaris/ai-openai-pipeline

OpenAI pipeline — streaming chat, function calling, tool use, embeddings,...

22
Experimental
169 Columba1198/EgaraNet

Train and run inference with EgaraNet, a model that encodes illustration art...

22
Experimental
170 Tox1469/embed-search

Semantic search with pgvector and OpenAI embeddings

21
Experimental
171 anthril/cloudflare-worker-templates

Production-ready Cloudflare Worker templates: Twilio Voice Agent (OpenAI...

21
Experimental
172 itang1/codenames_cluegiver

A terminal-based Codenames solver and game runner.

21
Experimental
173 adi2355/multi-source-rag-pipeline

Multi-source RAG pipeline with hybrid vector + keyword retrieval,...

21
Experimental
174 quentinvespero/podcast_transcription

a tool to download, transcript and perform semantic/keyword searches on...

20
Experimental
175 quentinvespero/voxearch

a tool to download, transcribe and perform semantic/keyword searches on...

20
Experimental
176 iamshrisawant/sorted

Sorted (SortedPC) is a privacy-focused, local-first file organization tool...

20
Experimental
177 07Codex07/Reel2Retail

AI pipeline that detects fashion items in short videos, matches them to a...

20
Experimental
178 MyronKoch/longterm-memory-macos

PostgreSQL + pgvector semantic memory system for Claude with browser...

19
Experimental
179 apex-bridge/dedupkit

Semantic deduplication using embeddings

19
Experimental
180 cliffordnwanna/JOB_HUNTER

Enterprise-grade job matching platform using a Hybrid Matching Engine...

19
Experimental
181 AndyyyYuuu/neologisms

LLMs have words for things humans don't.

18
Experimental
182 K-ShashankChowdary/CivicConnect

An AI-powered public grievance and complaint management platform built on...

18
Experimental
183 shrashti-19/PromptPulse

PromptPulse is a backend-centric system designed to securely store, manage,...

18
Experimental
184 dev-diaries41/sereleum

Prompt analytics platform that turns real user prompts into actionable insights

18
Experimental
185 MomenMushtaha/MessageAI

WhatsApp clone for iOS with built-in AI. Summarizes conversations, extracts...

18
Experimental
186 yinanli1917-cloud/searching-apple-notes

Search Apple Notes semantically with Claude Code. BGE-M3 embeddings,...

18
Experimental
187 mrlnlms/whatsapp-interaction-analysis

End-to-end data science pipeline for WhatsApp conversation analysis —...

18
Experimental
188 stxkxs/aws-bedrock-api

A Spring Boot application for interacting with AWS Bedrock models for AI...

18
Experimental
189 199-biotechnologies/engram-2

Persistent memory for AI agents. Single Rust CLI, hybrid Gemini + FTS5 + RRF...

17
Experimental
190 pbuitragoa33/Knowledge-Base-Curator-Agent

AI-powered curation agent for dynamic university courses or subjects. Uses...

17
Experimental
191 nexy39/semantic-search-document-tool

Multi-tool for semantic search

17
Experimental
192 uw-math-ai/theorem-search-app

Theorem Search App

17
Experimental
193 YehezkielG/Locaface-AI-Microservices

this repo is AI-Microservices for locaface

17
Experimental
194 ABDELRAHMAN-ELRAYES/Vai

An AI Documents Knowledge Assistant, RAG pipeline.

16
Experimental
195 nwyrwas/neural-os

AI-powered notes app with semantic search using OpenAI embeddings, RAG...

16
Experimental
196 YOUSEF-ysfxjo/ml-nlp-guide

ML & NLP learning guide — Word2Vec, GloVe, FastText comparison on...

16
Experimental
197 innov8ideas4u-alt/sovereign-memory-mcp

Local-first MCP memory server with pgvector, hybrid retrieval, and nightly...

16
Experimental
198 Gabriele-tomai00/UniTS-SQL-RAG

Chatbot for the UniTS database: about calendars, events, rooms, courses, subjects

16
Experimental
199 HollyLight28/SynaptoClaw

Advanced Cognitive Memory for AI Agents: 7-Channel Hybrid Scoring, Knowledge...

16
Experimental
200 SachinJangirX/Intelligence-Document-Analysis-System-IDAS

Offline Document Intelligence System (IDAS) for Q&A, report generation, and...

16
Experimental
201 phanxuanquang/EmbeddingGemma.NET

The .NET library to integrate Google's EmbeddingGemma-300m model into .NET projects

16
Experimental
202 saiteja6006/AI-Invoice-System

AI-powered invoice processing system using multi-agent pipeline, RAG, OCR,...

16
Experimental
203 pulipakav1/rag_data

Semantic search over 37K+ customer review embeddings via ChromaDB + Claude....

16
Experimental
204 oliveiradniel/letmeask.web

Frontend em React para sistema de IA com RAG, permitindo gravação de áudio...

16
Experimental
205 hafsa-imtiaz/Semantic-Search-Module

A modular, GUI-based semantic search system built with Streamlit,...

16
Experimental
206 oliveiradniel/letmeask.server

API de IA com RAG que transforma áudios de salas em conhecimento pesquisável...

16
Experimental
207 mariakasimceva0305-ux/hybrid-faq-retrieval-benchmark

Стенд для сравнения поиска BM25, векторного поиска, объединения сигналов и...

15
Experimental
208 Ashukr321/ResolveX-AI

ResolveX AI is an AI-powered customer support and ticket management system...

15
Experimental
209 matuteiglesias/awesome-automation-for-knowledge-work

Patterns, case studies, capabilities, and tools for automating knowledge work.

15
Experimental
210 kevinmalana/rag-pipeline

Minimal RAG pipeline with LangChain + ChromaDB + PDF embeddings

15
Experimental
211 V-Dickerson/discord_record_search

Entirely AI-coded project intended to explore the strengths & limitations of...

15
Experimental
212 stef41/embedding-similarity

Cosine similarity and vector operations for embeddings, zero dependencies

15
Experimental
213 drew0716/ragcli

RAG-in-a-Box CLI — chat with your PDFs, Word docs, and spreadsheets from the...

15
Experimental
214 KongaraLikhith/austin-eats

AI-powered restaurant discovery platform using Flask and AlloyDB AI....

15
Experimental
215 psmlabs/mem.sh

Persistent memory for AI agents. One line to save, one line to recall.

15
Experimental
216 shashank2408/Product-Stream

Real-time product search pipeline — Kafka, OpenSearch, semantic vector...

15
Experimental
217 tirthas970-cmyk/Ml-verifier-app-streamlit

High-Precision Topic Verifier Bot

15
Experimental
218 stef41/embedding-store

In-memory vector store with cosine similarity search

15
Experimental
219 Jackylwl/nlp-emotion-classifier

Text emotion classification project with classical ML and transformer models.

15
Experimental
220 Mohit1234-gif/contextual-vault

Local semantic search for your files — powered by Ollama. No API keys. No...

15
Experimental
221 afaizalam2003/aintropy-retrieval-layer

Sub-second retrieval middleware with semantic caching and cross-encoder...

15
Experimental
222 pratim4dasude/PDReader

Turn PDFs into searchable, conversational knowledge using AI.

15
Experimental
223 powerfist01/retrieval-augmented-generation

A progressive RAG project - from basic vector search to agentic retrieval,...

15
Experimental
224 azizhalloul/local-rag

Local RAG Engine from scratch using FAISS & Gemma 3 & Ollama

15
Experimental
225 Dhyey2294/westside-ai-chatbot

AI-powered shopping chatbot with product search, conversational refinement,...

15
Experimental
226 muneeb-amir/semantic-product-search-ranking

Deep learning-based semantic product search and ranking system using SBERT...

15
Experimental
227 jsilvanus/embedeer

A Node.js model tool, which supports embedding with batched input, parallel...

14
Experimental
228 oyoai/personal-knowledge-system

A prototype system for transforming raw thought streams into a structured,...

14
Experimental
229 YOUSEF-ysfxjo/coffee-flavor-map

Coffee flavor semantic map — comparing Word2Vec, GloVe, FastText on the same...

14
Experimental
230 montanaflynn/image-search

Semantic image search TUI powered by Antfly and Gemini multimodal embeddings

14
Experimental
231 GabrielCastro1221/sentinel_track_systems

Sentinel Track Systems es una plataforma integral que combina IoT (Internet...

14
Experimental
232 sheeryn123/docquery-rag

Enterprise document Q&A with RAG, citations, and workspaces

14
Experimental
233 pranavathiyani/EmbedAMR

Exploring the AMR protein embedding landscape with ESM2 and Ankh....

14
Experimental
234 didulabhanuka/semantic-search

Full-stack semantic document search engine — upload PDFs and search them...

14
Experimental
235 Zahidmohd/O2C-Insight-Engine

AI-powered NL query engine — ask questions in plain English, get SQL + graph...

14
Experimental
236 Chanakya1305/rag-document-qa

RAG-based document intelligence platform with Claude AI for accurate Q&A,...

14
Experimental
237 Andre151989/obsidian-lilbee

Enhance Obsidian search by indexing deleted files to keep them visible and...

14
Experimental
238 Bitwarelabscom/bwmem

Memory SDK for AI chatbots — facts, semantic search, emotional capture,...

14
Experimental
239 nyasalohiya/rag-fastapi

FastAPI RAG document question answering system using embeddings and semantic search

14
Experimental
240 Jainil570/Batman-Ai---Exam-Assistant

Why do we fall? So that we can learn to pick our grades back up.

14
Experimental
241 Tahrim19/AI-Projects

A curated list of my AI/ML projects — FYP, computer vision, NLP, and more.

14
Experimental
242 BihanBanerjee/holdmind

Belief-centric memory AI — chat, extract beliefs, build a knowledge graph...

14
Experimental
243 viniciustakedi/second-brain

Second Brain (Docker Edition): self-hosted Obsidian workspace with Ollama...

14
Experimental
244 Vishal-hub/Lumina

Offline first AI-powered desktop gallery 🌌

14
Experimental
245 dinhanhx/vLLM-emb-rer

deploy embeddings models and reranker models with vLLM

14
Experimental
246 KesavaAI/rag-azure-nasa

Advanced RAG-based QA system using Azure OpenAI & AI Search to answer...

14
Experimental
247 KrishChordiya/research-rag

A high-performance, full-stack RAG engine featuring image support, FlashRank...

14
Experimental
248 AnnaGals-10/second-brain

Build searchable knowledge graphs from Claude conversations with semantic...

14
Experimental
249 rebeizantoine/documind-backend

Backend for DocuMind — AI-powered document Q&A using RAG, embeddings, and...

14
Experimental
250 Anshu-raj-co/CINEMALYZE-semantic-sentiment-analysis

End-to-end NLP system using TF-IDF, cosine similarity, and dual-model...

14
Experimental
251 AndrewGlukhoff/ai-rag

Ассистент на базе paraphrase-multilingual-MiniLM-L12-v2, отвечает на вопросы...

14
Experimental
252 Manas-DE-Archieve/archivdin-frontend

🎨 Frontend for Archive Voice - a modern React + Vite interface with ⚡ fast...

14
Experimental
253 G-SEOFramework/g-seo-framework

G-SEO (Generative Search Optimization) is a structured framework and scoring...

14
Experimental
254 pouryahoseini/Multimodal-Video-Search

An end-to-end, two-stage multimodal AI pipeline for zero-shot semantic video...

14
Experimental
255 KaigorodovTuskul/easyrag

Local-first easy one-click DOCX RAG with exact/BM25/hybrid retrieval,...

14
Experimental
256 Tathagata642/LangChain_Models

LangChain learning projects covering LLMs, Chat Models, and Embeddings using...

14
Experimental
257 AIDataNordic/Food-Recipe-MCP

A production-grade semantic search server for food recipes — built for AI...

14
Experimental
258 roberteisenberg/mcp-knowledge-graph

Clinical intelligence tool built across 6 phases — demonstrates reducing LLM...

14
Experimental
259 Manas-DE-Archieve/archivdin-backend

🚀 Backend for Archive Voice - a FastAPI-powered service for managing...

14
Experimental
260 rebeizantoine/documind

AI-powered document Q&A app using RAG, semantic search, and grounded responses.

14
Experimental
261 getreka/reka-plugin

Claude Code plugin — RAG-powered AI development with semantic search,...

14
Experimental
262 Cash-Codes/AI_fishing_copilot

A hybrid RAG + recommendation system using embeddings, FAISS and Vertex AI...

14
Experimental
263 TalKleinBgu/Zap

Product deduplication pipeline for Israeli price-comparison — Hebrew/English...

14
Experimental
264 NasimReja077/raven-ai

RavenAI - A personal knowledge vault that hoards, connects, and resurfaces...

14
Experimental
265 mrcz8/ragbox

Local RAG API service powered by Ollama and pgvector. Drop it into your...

14
Experimental
266 abubakkersiddiqq/deep-reader

A semantic document Q&A API upload a PDF and ask questions about it. The...

14
Experimental
267 Dantzzz/scotus_rag

RAG Pipeline to query landmark SCOTUS cases

14
Experimental
268 codysnider/FalseMemBench

Adversarial benchmark for memory retrieval systems, with noisy distractors,...

14
Experimental
269 monkey1901010101/ai-order-status-instagram-bot

AI Instagram Chatbot 2026 - Free Customer Support Automation 🤖

14
Experimental
270 ujjwalutkarsh21/RAG-Techniques.

Ongoing RAG learning code base which includes Advance RAG techniques and...

14
Experimental
271 Nurexcoder/zrift

An AI-powered ecommerce search bot that understands natural language queries.

14
Experimental
272 eugeniaeltsova/Sillage

AI-powered Assistant to Find Your Best Perfume

14
Experimental
273 mx6315909/xiaodi-obsidian-brain-pro

拒绝 AI 爹味!这款 Obsidian 插件只记你的原话,不装逼。

14
Experimental
274 LuizDML/finlab

Laboratório financeiro - Usa RAG e/ou Agents para recomendar compra, venda...

14
Experimental
275 Scenograph/state-of-ai

Track agentic development with Claude Code, learn project setup, and manage...

14
Experimental
276 seffhunnn/rag-pdf-chatbot

An AI-powered document assistant that lets you "chat" with your PDFs using...

14
Experimental
277 cirobdomingos-cyber/support-ticket-intelligence

Complete end to end support ticket intelligence

14
Experimental
278 Anshuljain-bit/HireLens

AI hiring assistant for resume embeddings, semantic job matching, candidate...

14
Experimental
279 JoanixX/candidate_ranking_platform

Plataforma full-stack que analiza perfiles y realizar recomendaciones para...

14
Experimental
280 fntune/ragrep

Hybrid FAISS + BM25 RAG pipeline — multi-source ingestion, incremental...

14
Experimental
281 jonas-tfo/simvek

Semantic similarity search for amino acid sequences by embedding a given...

14
Experimental
282 HungPhamNoob/Embed-KCPD

Unsupervised Text Segmentation via Kernel Change-Point Detection on Sentence...

14
Experimental
283 selentium/embedserve

High-performance embedding inference service with dynamic batching, FastAPI,...

14
Experimental
284 ayushcodes13/tendermatch

A multi-source AI pipeline that scrapes, filters, and matches live tenders...

14
Experimental
285 Raynan00/LifeGraph

Turn your icloud photos and bank statements into interactive visual maps of...

14
Experimental
286 kuwacom/infinity-on-legacy-gpu

infinityを少し前の世代のGPUで動作さるためのリポジトリ | Repository to run Infinity on slightly...

14
Experimental
287 Mentorzx/MCP-register

Python MCP server for user registration and semantic search with SQLite,...

14
Experimental
288 virbahu/supply-chain-knowledge-graph

Enterprise supply chain knowledge graph for multi-tier supplier...

13
Experimental
289 TheKangChen/classes-discovery

NYPL Techconnect classes hybrid search (Semantic search + lexical search) demo page.

13
Experimental
290 tasnime-bbker/DeepMinds

Context-Aware FinCommerce Engine for Smart Discovery & Recommendations | ...

13
Experimental
291 cskwork/LEANN-RAG-QUICKSTART

Quick-start template for RAG pipeline with LangChain and embeddings

13
Experimental
292 Faraz6180/pdf-to-audiobook-ai

AI system that converts PDFs into audiobooks, summaries, podcast-style...

13
Experimental
293 omaralaswad/medical-ai-multimodal-system

End-to-end multimodal medical AI pipeline — VLM benchmarking, FAISS semantic...

13
Experimental
294 avocatt/kvkk-rag-experiments

RAG experiments for Turkish KVKK (data protection) documents

11
Experimental