ArchiveBox/abx-dl

⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git repos, and more...

54
/ 100
Established

This tool helps anyone who needs to fully capture a webpage or online content. You provide a URL, and it downloads everything associated with that page: HTML, images, videos, PDFs, article text, and even entire websites. It's ideal for researchers, journalists, or anyone building a personal archive of web content.

102 stars. Available on PyPI.

Use this if you need to reliably download all available content from a URL, including dynamic elements and media, for archiving, research, or offline access.

Not ideal if you only need a simple screenshot or specific text from a page and prefer a lightweight, single-purpose tool.

web-archiving digital-preservation online-research content-capture OSINT
Maintenance 13 / 25
Adoption 9 / 25
Maturity 25 / 25
Community 7 / 25

How are scores calculated?

Stars

102

Forks

4

Language

Python

License

MIT

Last pushed

Mar 27, 2026

Commits (30d)

0

Dependencies

11

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/perception/ArchiveBox/abx-dl"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.