pedrokohler/github-repo-to-single-file

TypeScript CLI that pulls a GitHub repo and merges all text-like files into one clean .txt or .pdf. Skips binaries, streams progress, writes to /out. Ideal for LLMs and RAG: feed an entire codebase as a single artifact so models can use the repository context seamlessly.

24
/ 100
Experimental

This tool helps developers consolidate all the text-based files from a GitHub repository or local codebase into a single, clean text or PDF document. It takes a repository URL or local directory path as input and produces a merged file in the 'out/' directory. Software engineers or AI/ML practitioners preparing codebases for analysis or large language models would find this useful.

Use this if you need to create a unified view of an entire codebase for documentation, review, or as input for AI models like those used in RAG (Retrieval Augmented Generation).

Not ideal if you need to preserve the original file structure, only care about a few specific files, or are working with non-text files like images or compiled binaries.

codebase-analysis software-documentation AI-model-training developer-tools text-extraction
No License No Package No Dependents
Maintenance 6 / 25
Adoption 5 / 25
Maturity 7 / 25
Community 6 / 25

How are scores calculated?

Stars

12

Forks

1

Language

TypeScript

License

Last pushed

Dec 12, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/rag/pedrokohler/github-repo-to-single-file"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.