Ryaang/gpt-web-crawler

A web crawler for GPTs to build knowledge bases 用于GPT构建知识库的网站爬虫

/ 100

Emerging

This tool helps you gather information from websites to create a knowledge base for AI agents or other data analysis. You input a starting website URL, and it outputs a JSON file containing the website's text, titles, keywords, and descriptions. It's designed for data analysts, AI developers, or anyone building AI agents who needs structured web content without complex coding.

No commits in the last 6 months.

Use this if you need to quickly extract content from specific websites to feed into an AI agent, database, or perform content analysis.

Not ideal if you require advanced custom crawling behaviors, very deep and complex site navigation, or extensive interaction with dynamic web elements beyond basic content extraction.

AI knowledge base creation web content extraction data collection AI agent training competitive intelligence

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 16 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

jamesturk/scrapeghost

👻 Experimental library for scraping websites using OpenAI's GPT API.

Priyanshu-hawk/ChatGPT-unofficial-api-selenium

This is unofficial ChatGPT API using selenium for prompt testing and flow testing purposes

3281448091/easyChatGPT

An unofficial yet elegant interface of the ChatGPT API using browser automation that bypasses...

ryuseisan/auto-chatgpt

Automate interaction with the browser version of ChatGPT.

djb-gt/gpt-automated-web-scraper

The GPT-based Universal Web Scraper MVP is a solution that leverages GPT models and web scraping...

Explore LLM Tools

All categories Trending LLM Tool directory Insights