samber/the-great-gpt-firewall

🤖 A curated list of websites that restrict access to AI Agents, AI crawlers and GPTs

45
/ 100
Emerging

This project provides a monthly-updated list of websites that use their 'robots.txt' file to prevent AI agents and crawlers, like those used by ChatGPT or Google's Bard, from accessing their content. It details which major media, video, and music sites block AI access. Website owners, content creators, and digital rights managers can use this to understand how various platforms protect their content from AI scraping.

Use this if you are a website owner, content creator, or digital rights professional looking to understand which major platforms restrict AI access and how to implement similar protections for your own site.

Not ideal if you are looking for an active tool to automatically block AI bots on your own website, as this is primarily an informational resource.

website-management content-protection digital-rights web-publishing AI-ethics
No Package No Dependents
Maintenance 10 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 10 / 25

How are scores calculated?

Stars

91

Forks

7

Language

Python

License

MIT

Last pushed

Mar 01, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/samber/the-great-gpt-firewall"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.