Code Repository Intelligence Embedding Tools
Tools for indexing, analyzing, and semantically searching code repositories using embeddings and AST parsing. Includes code understanding, commit message generation, and code-aware Q&A systems. Does NOT include general code review platforms, CI/CD tools, or non-semantic code search.
There are 75 code repository intelligence tools tracked. 9 score above 50 (established tier). The highest-rated is dtsola/xiaoyaosearch at 67/100 with 1,207 stars. 3 of the top 10 are actively maintained.
Get all 75 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=embeddings&subcategory=code-repository-intelligence&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
dtsola/xiaoyaosearch
小遥搜索,听懂你的话、看懂你的图,用AI找到本地任何文件。让搜索像聊天一样简单。XiaoyaoSearch: Understands your... |
|
Established |
| 2 |
cocoindex-io/cocoindex
Data transformation framework for AI. Ultra performant, with incremental... |
|
Established |
| 3 |
justincasher/lean-explore
A search engine for Lean 4 declarations |
|
Established |
| 4 |
Ryandonofrio3/osgrep
Open Source Semantic Search for your AI Agent |
|
Established |
| 5 |
yoanbernabeu/grepai
Semantic Search & Call Graphs for AI Agents (100% Local) |
|
Established |
| 6 |
infinilabs/coco-server
🥥 Coco AI Server - Search, Connect, Collaborate, AI-powered Enterprise... |
|
Established |
| 7 |
probelabs/probe
AI-friendly semantic code search engine for large codebases. Combines... |
|
Established |
| 8 |
starthackHQ/Contextinator
Turning messy repos into weapons of mass structured context. |
|
Established |
| 9 |
scarletkc/vexor
A semantic search engine for files and code. |
|
Established |
| 10 |
revokslab/codecrawl
🌊 Turn entire codebases into LLM-ready data. Extract data, search, and... |
|
Emerging |
| 11 |
cyberytti/ToolHunt
This is a local search engine to search for cybersecurity tools. It has... |
|
Emerging |
| 12 |
SylphxAI/coderag
Lightning-fast semantic code search with AST chunking (15+ languages) -... |
|
Emerging |
| 13 |
auyelbekov/rawq
Context retrieval engine for AI agents — semantic + lexical search over codebases |
|
Emerging |
| 14 |
XiaoConstantine/sgrep
CLI for semantic grep |
|
Emerging |
| 15 |
tomohiro-owada/devrag
Markdown vector search MCP server for Claude Code. Natural language search... |
|
Emerging |
| 16 |
SecrinLabs/secrin
The living wiki that writes itself. |
|
Emerging |
| 17 |
billlzzz10/bl1nk-mcp-server
Modular, audit-ready memory system combining knowledge graph, semantic... |
|
Emerging |
| 18 |
rekal-dev/rekal-cli
Git-anchored decentralised intent(conversation) ledger for teams who build with AI |
|
Emerging |
| 19 |
definitive-io/code-indexer-loop
Code Indexer Loop is a Python library for indexing and retrieving source... |
|
Emerging |
| 20 |
M9nx/CodexA
Codexa is a local semantic code intelligence CLI designed to help AI... |
|
Emerging |
| 21 |
sagarmk/beacon-plugin
Semantic code search plugin for Claude Code using hybrid vector search +... |
|
Emerging |
| 22 |
wjddusrb03/diffmind
AI Code Review Memory - learns from your team's bug history and warns when... |
|
Emerging |
| 23 |
mvp-scale/aOa
Semantic compression Claude and Gemini. 5 angles of O(1) indexed search —... |
|
Emerging |
| 24 |
pwrdrvr/ghcrawl
Terminal UI and local CLI for crawling GitHub issues and pull requests,... |
|
Emerging |
| 25 |
iagocavalcante/claude-turbo-search
Optimized file search and semantic indexing for large codebases in Claude Code |
|
Emerging |
| 26 |
chiraag-kakar/context-nexus
Context Nexus is an AI-native backend platform and SDK for managing,... |
|
Emerging |
| 27 |
jsbattig/code-indexer
Python application to index code locally and support running server with... |
|
Emerging |
| 28 |
zircote/rlm-rs-plugin
Claude Code plugin for processing documents 100x larger than context limits... |
|
Experimental |
| 29 |
dvcdsys/code-index
Semantic code search powered by embeddings. Search your codebase by meaning,... |
|
Experimental |
| 30 |
sltnsrh/knowledge-base
Semantic search knowledge base with vector embeddings and Claude MCP integration |
|
Experimental |
| 31 |
Pomilon/Kestr
High-performance daemon for real-time codebase indexing. Generates semantic... |
|
Experimental |
| 32 |
AssahBismarkabah/42context
A Local Context Engine |
|
Experimental |
| 33 |
Tomcat132025/odino
🔍 Discover and access your code quickly with Odino, a fast local semantic... |
|
Experimental |
| 34 |
Shun0212/Owl-CLI
Semantic code search using vector embeddings. Search your codebase with... |
|
Experimental |
| 35 |
ShadReyes/cortex-recall
Semantic code & git history search CLI — tree-sitter parsing, pluggable... |
|
Experimental |
| 36 |
usr-wwelsh/botdocs
Turn md into a pretty site with chatbot |
|
Experimental |
| 37 |
Sharper-Flow/lgrep
Dual-engine code intelligence for OpenCode: semantic code search plus symbol... |
|
Experimental |
| 38 |
solomonneas/code-search-api
Local semantic code search with Ollama embeddings, SQLite, and hybrid... |
|
Experimental |
| 39 |
Aliipou/codebase_intelligence
AI-powered codebase intelligence: semantic search, dependency analysis, and... |
|
Experimental |
| 40 |
evoleinik/claude-grep
Search Claude Code session history. Regex + semantic (vector) search. Single... |
|
Experimental |
| 41 |
sert-xx/unified-blueprint
Documentation-as-Code middleware — structure Markdown docs into a Document... |
|
Experimental |
| 42 |
damiandelmas/flex
Local search and retrieval for AI Agents |
|
Experimental |
| 43 |
andycandy/CausewayAI
Next-gen semantic retrieval system. Combines the power of Qdrant vector... |
|
Experimental |
| 44 |
souldriver007/karp-bible-code
AI-assisted ELS (Equidistant Letter Spacing) Bible code research engine for... |
|
Experimental |
| 45 |
souldriver007/karp-word-graph
AI-powered KJV Bible study companion for Claude Desktop. Semantic scripture... |
|
Experimental |
| 46 |
Stahldavid/sensegrep
Semantic + structural code search for AI-native development |
|
Experimental |
| 47 |
jsuppe/loom
🧵 Requirements traceability for AI-assisted development. Extract... |
|
Experimental |
| 48 |
mvp-scale/aOa-legacy
5 angles. 1 attack. O(1) indexed search. Up to 95% fewer tokens per... |
|
Experimental |
| 49 |
PEACEBINFLOW/mindscript-search
Semantic & structural search engine for the MindScript ecosystem. Index... |
|
Experimental |
| 50 |
PPierzc/hive
🐝🔍 Hive: A CLI Tool for Semantic Searching of Your Knowledge Base |
|
Experimental |
| 51 |
parbhatkapila4/RepoDocs
An AI-Powered Code Documentation Platform Automated documentation engine... |
|
Experimental |
| 52 |
josehu07/codetective
Takes code, gives AI authorship detection in five clicks :mag_right: |
|
Experimental |
| 53 |
tanuj077/codeatlas
CodeAtlas: AI-powered code search and chat system using AST parsing,... |
|
Experimental |
| 54 |
MohammedNasserAhmed/CodeXpert
CodeXpert: A cutting-edge AI-powered code analysis tool leveraging... |
|
Experimental |
| 55 |
louisfghbvc/CppSeek
AI-Powered Semantic Search for C/C++ |
|
Experimental |
| 56 |
davidteren/code_grasp
A CLI tool that uses the Qodo-Embed-1-1.5B embedding model to analyze code,... |
|
Experimental |
| 57 |
moabualruz/rice-search
A fully local, production-ready code search platform with hybrid BM25 +... |
|
Experimental |
| 58 |
ThinkerYzu/kb-indexer
LLM-powered knowledge base indexer that builds a growing semantic layer of... |
|
Experimental |
| 59 |
nshkrdotcom/portfolio_manager
AI-native personal project intelligence system - manage, track, and search... |
|
Experimental |
| 60 |
Lioness100/decimeta
A website to help you find the correct Dewey Decimal number for any subject using AI. |
|
Experimental |
| 61 |
nshkrdotcom/portfolio_coder
Code Intelligence Platform: Repository analysis, semantic code search,... |
|
Experimental |
| 62 |
copyleftdev/tala
Intent-native narrative execution layer. Reimagines Linux shell history as a... |
|
Experimental |
| 63 |
souldriver007/karp-graph-lite
"Personal knowledge graph for Claude Desktop — remember, recall, connect" |
|
Experimental |
| 64 |
gantumurbattumur/Github-aware-RAG
Semantic search across your starred and own GitHub repos, right inside VS... |
|
Experimental |
| 65 |
bhavesh-kalluru/genai-project-2026-03-27
AI-powered CLI that analyzes git diffs and generates conventional commit... |
|
Experimental |
| 66 |
bhavesh-kalluru/genai-project-2026-03-26
AI-powered Python code review tool that detects anti-patterns using... |
|
Experimental |
| 67 |
moijafcor/glean
Ask plain-English questions about your projects — source code,... |
|
Experimental |
| 68 |
NeaByteLab/AI-Indexing
Code indexing examples for converting source code into structured repository... |
|
Experimental |
| 69 |
The-Focus-AI/embeddings-search-skill
Claude Code plugin for hybrid document search (grep + semantic embeddings) |
|
Experimental |
| 70 |
NeaByteLab/Dev-Knowledge
Build searchable knowledge bases by scraping developer documentation and... |
|
Experimental |
| 71 |
oroinc/documentation-markdown
Markdown variant for AI |
|
Experimental |
| 72 |
luanvenancio/design-extractor
A self-hosted backend that captures websites, extracts design signals, and... |
|
Experimental |
| 73 |
Agents365-ai/semanticscholar-skill
Claude Code skill for academic paper search using the Semantic Scholar API |
|
Experimental |
| 74 |
SyntaxSidekick/historisnap
HistoriSnap is an interactive explorer of historical events — from ancient... |
|
Experimental |
| 75 |
SoulDev07/CodeCompass
CodeCompass transforms any GitHub repo into an interactive, searchable... |
|
Experimental |