revokslab/codecrawl
🌊 Turn entire codebases into LLM-ready data. Extract data, search, and llms.txt from any repo with a single API.
This tool helps AI engineers and developers prepare public codebases for use with large language models. You provide a repository URL, and it generates clean markdown, structured data, or a 'llms.txt' file, ready to be fed into your AI applications. It's designed for anyone building AI tools that need to understand or process code.
Use this if you need to quickly and cleanly extract data, analyze file structures, or semantically search public code repositories to train or empower your AI models.
Not ideal if you need to process private repositories or require deeply customized data extraction beyond the provided formats.
Stars
79
Forks
8
Language
TypeScript
License
AGPL-3.0
Category
Last pushed
Jan 25, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/revokslab/codecrawl"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
dtsola/xiaoyaosearch
小遥搜索,听懂你的话、看懂你的图,用AI找到本地任何文件。让搜索像聊天一样简单。XiaoyaoSearch: Understands your words, reads your...
cocoindex-io/cocoindex
Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if...
justincasher/lean-explore
A search engine for Lean 4 declarations
Ryandonofrio3/osgrep
Open Source Semantic Search for your AI Agent
yoanbernabeu/grepai
Semantic Search & Call Graphs for AI Agents (100% Local)