ntddk/peeling-onions
A repository to store Deep Web (onion domain) crawler, scraper, and NLP tools for Tor network.
This tool helps intelligence analysts and researchers explore content on the Deep Web's Tor network. It takes Deep Web addresses (onion domains) as input and outputs scraped text and analyzed information from those sites. Security researchers, law enforcement, and academic researchers investigating online illicit activities or hidden communities would use this.
No commits in the last 6 months.
Use this if you need to systematically collect and understand information from websites operating on the Tor network.
Not ideal if you are looking to crawl the surface web or require advanced anonymization beyond basic Tor network access.
Stars
23
Forks
7
Language
Perl
License
—
Category
Last pushed
Jul 15, 2016
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/ntddk/peeling-onions"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
flairNLP/fundus
A very simple news crawler with a funny name
fhamborg/news-please
news-please - an integrated web crawler and information extractor for news that just works
affjljoo3581/canrevan
대량의 네이버 뉴스 기사를 수집하는 라이브러리입니다.
FreeDiscovery/FreeDiscovery
Web Service for E-Discovery Analytics
tirthajyoti/Web-Database-Analytics
Web scrapping and related analytics using Python tools