octokami/darknet_forum
Scraping darknet forums has been the object of research in the field of intelligence and security informatics. In this repository we scraped three different sources of forums, and performed Sentiment Analysis, Topic Modelling, and Linguistic Analysis.
This project helps intelligence analysts and security researchers understand and monitor discussions on darknet, deep web, and surface web forums. It takes raw forum text data as input and produces insights into sentiment, popular topics, and linguistic patterns. Law enforcement agencies or cybersecurity firms can use this to detect emerging threats or radicalization.
No commits in the last 6 months.
Use this if you need to analyze textual content from online forums, especially those in less accessible parts of the internet, to identify extremist content, sentiment, or key discussion points related to potential real-world incidents.
Not ideal if you are looking for a moderation tool to automatically flag explicit or harmful content, as the tested OpenAI API showed limited effectiveness in this specific domain.
Stars
11
Forks
2
Language
Jupyter Notebook
License
CC0-1.0
Category
Last pushed
May 18, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/octokami/darknet_forum"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Altimis/Scweet
A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...
lexiforest/curl_cffi
Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...
plabayo/rama
modular service framework to move and transform network packets
scrapinghub/spidermon
Scrapy Extension for monitoring spiders execution.