niizam/4chan-datasets
Scrape any 4chan boards
This tool helps researchers and data analysts gather discussions from any 4chan board. You input the raw text files from specific boards, and it outputs a clean, organized dataset in JSON or CSV format. It's designed for those who need to analyze online forum content for trends, sentiment, or linguistic patterns.
No commits in the last 6 months.
Use this if you need to systematically collect and structure conversations from 4chan for research or analysis.
Not ideal if you're looking for real-time monitoring or a simple way to browse 4chan content without extensive data processing.
Stars
10
Forks
3
Language
Python
License
—
Category
Last pushed
Apr 21, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/niizam/4chan-datasets"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jamesturk/scrapeghost
👻 Experimental library for scraping websites using OpenAI's GPT API.
Priyanshu-hawk/ChatGPT-unofficial-api-selenium
This is unofficial ChatGPT API using selenium for prompt testing and flow testing purposes
3281448091/easyChatGPT
An unofficial yet elegant interface of the ChatGPT API using browser automation that bypasses...
ryuseisan/auto-chatgpt
Automate interaction with the browser version of ChatGPT.
Ryaang/gpt-web-crawler
A web crawler for GPTs to build knowledge bases 用于GPT构建知识库的网站爬虫