jonasjacek/robots.txt

Simple robots.txt template. Keep unwanted robots out (disallow). White lists (allow) legitimate user-agents. Useful for all websites.

46
/ 100
Emerging

This project offers pre-configured files to help you control which automated web programs (like search engine crawlers or data scrapers) can access your website. You provide the template to your website, and it instructs compliant bots on what they can and cannot do. Webmasters and website owners who want to manage site traffic and privacy would use this.

No commits in the last 6 months.

Not ideal if you need highly complex, dynamic, or user-specific rules for bot interaction that go beyond the Robots Exclusion Standard.

website-management SEO-control web-privacy bot-traffic-management content-access
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 21 / 25

How are scores calculated?

Stars

88

Forks

38

Language

License

Category

scraper

Last pushed

Feb 16, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/perception/jonasjacek/robots.txt"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.