Ryaang/gpt-web-crawler

A web crawler for GPTs to build knowledge bases 用于GPT构建知识库的网站爬虫

39
/ 100
Emerging

This tool helps you gather information from websites to create a knowledge base for AI agents or other data analysis. You input a starting website URL, and it outputs a JSON file containing the website's text, titles, keywords, and descriptions. It's designed for data analysts, AI developers, or anyone building AI agents who needs structured web content without complex coding.

No commits in the last 6 months.

Use this if you need to quickly extract content from specific websites to feed into an AI agent, database, or perform content analysis.

Not ideal if you require advanced custom crawling behaviors, very deep and complex site navigation, or extensive interaction with dynamic web elements beyond basic content extraction.

AI knowledge base creation web content extraction data collection AI agent training competitive intelligence
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 16 / 25

How are scores calculated?

Stars

25

Forks

7

Language

Python

License

MIT

Last pushed

Sep 02, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/Ryaang/gpt-web-crawler"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.