Puzer/github-repo-embeddings
Self-supervised embeddings for GitHub repositories from ⭐ Stars. Includes data pipeline, training code, and frontend.
This project helps developers discover new open-source tools and compare their technical interests by analyzing how millions of developers star GitHub repositories. You input a repository or your starred list, and it outputs similar repositories or a comparison of your interests with others. It's designed for software developers, team leads, or open-source enthusiasts looking to broaden their tech stack or understand community trends.
Use this if you want to find alternative libraries, explore new tools based on your existing interests, or semantically compare your development profile with others.
Not ideal if you need to analyze private repositories or are looking for a system that uses explicit textual descriptions rather than collaborative filtering from star data.
Stars
53
Forks
2
Language
Svelte
License
CC0-1.0
Category
Last pushed
Jan 02, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/Puzer/github-repo-embeddings"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
Azure/azure-search-vector-samples
A repository of code samples for Vector search capabilities in Azure AI Search.
curiosity-ai/catalyst
🚀 Catalyst is a C# Natural Language Processing library built for speed. Inspired by spaCy's...
supabase/embeddings-generator
GitHub Action to generate embeddings from the markdown files in your repository.
vector-ai/vectorai
Vector AI — A platform for building vector based applications. Encode, query and analyse data...
wagtail/wagtail-vector-index
Store Wagtail pages & Django models as embeddings in vector databases