All Perception Tools
10,540 tools ranked by quality score · Page 3 of 106
| # | Tool | Score | Tier |
|---|---|---|---|
| 201 |
commoncrawl/news-crawl
News crawling with StormCrawler - stores content as WARC |
|
Established |
| 202 |
programminghistorian/ph-submissions
The repository and website hosting the peer review process for new... |
|
Established |
| 203 |
cumbucadev/cinemaempoa
Site que agrega filmes em cartaz em algumas das diversas salas de cinema de... |
|
Established |
| 204 |
alpkeskin/rota
A high-performance proxy rotation engine with automated IP management and... |
|
Established |
| 205 |
yujiosaka/headless-chrome-crawler
Distributed crawler powered by Headless Chrome |
|
Established |
| 206 |
go-rod/rod
A Chrome DevTools Protocol driver for web automation and scraping. |
|
Established |
| 207 |
vakhov/fresh-proxy-list
Provides a list of fresh, working proxy servers (HTTP, HTTPS, SOCKS4 &... |
|
Established |
| 208 |
playwright-community/playwright-go
Playwright for Go a browser automation library to control Chromium, Firefox... |
|
Established |
| 209 |
EchterAlsFake/spankbang_api
A Python API for the porn site spankbang.com |
|
Established |
| 210 |
snooppr/shotstars
An advanced tool for checking GitHub repositories, with star statistics,... |
|
Established |
| 211 |
wael-sudo2/facebook-page-info-scraper
Free Facebook pages MetaData Scraping Library - Unlimited Calls |
|
Established |
| 212 |
sebastiancarlos/beachpatrol
🏝️ A CLI Tool to Automate Your Everyday Web Browser. |
|
Established |
| 213 |
wajik45/wajik-anime-api
REST API streaming dan download Anime subtitle Indonesia | sub Indo |
|
Established |
| 214 |
saifalisew1508/TelegramAdderTool
An Telegram Mass Members Adding/Scraping Tool Written In Python Using... |
|
Established |
| 215 |
ArchiveBox/abx-dl
⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like... |
|
Established |
| 216 |
lorey/socials
👨👩👦 Python library and CLI to turn URLs into structured social media profiles. |
|
Established |
| 217 |
DavidWittman/requests-random-user-agent
Configures the requests library to randomly select a desktop User-Agent |
|
Established |
| 218 |
dsheiko/puppetry
Web testing solution for non-developers on top of Puppeteer and Jest |
|
Established |
| 219 |
SoheilKhodayari/JAW
JAW: A Graph-based Security Analysis Framework for Client-side JavaScript |
|
Established |
| 220 |
ArchiveTeam/wget-lua
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC... |
|
Established |
| 221 |
ivan-sincek/file-scraper
Scrape files for sensitive information, and generate an interactive HTML... |
|
Established |
| 222 |
pzaino/thecrowler
A Content Discovery and Development Platform. Empowering Cybersecurity, AI,... |
|
Established |
| 223 |
hmasdev/pyjpboatrace
pyjpboatrace :speedboat: provides you with useful tools for data analysis... |
|
Established |
| 224 |
Madi-S/Lead-Generation
Python script, which empowers people with no programming background to... |
|
Established |
| 225 |
spyoungtech/behave-webdriver
Selenium webdriver step library for use with the behave BDD testing framework |
|
Established |
| 226 |
ngongcm/public-api-list
A curated collection of the best free, freemium, and paid APIs for developers. |
|
Established |
| 227 |
microlinkhq/cloudflare-bot-directory
CloudFlare Radar verified bots directory – 500+ web crawlers and user agents as JSON. |
|
Established |
| 228 |
serpapi/google-search-results-java
Google Search Results JAVA API via SerpApi |
|
Established |
| 229 |
clemfromspace/scrapy-puppeteer
Scrapy + Puppeteer |
|
Established |
| 230 |
akvise/trends-checker
Python CLI for Google Trends analysis with advanced rate limiting, cookie... |
|
Established |
| 231 |
spyboy-productions/omnisci3nt
Omnisci3nt is an open-source web reconnaissance and intelligence tool for... |
|
Established |
| 232 |
bitmakerla/estela
estela, an elastic web scraping cluster 🕸 |
|
Established |
| 233 |
Germey/AwesomeWebScraping
List of libraries, tools and APIs for web scraping and data processing. |
|
Established |
| 234 |
ToufiqQureshi/chuscraper
Stealth-native web & Android scraping framework powered by CDP and ADB with... |
|
Established |
| 235 |
ImranR98/Wealthsimpleton
A Python script that scrapes your Wealthsimple activity history and saves... |
|
Established |
| 236 |
OpenIsraeliSupermarkets/israeli-supermarket-scarpers
A python package with client to scrape the israeli supermarkets data |
|
Established |
| 237 |
jasonxtn/Argus
The Ultimate Information Gathering Toolkit |
|
Established |
| 238 |
ScrapeGraphAI/scrapecraft
🤖 AI-powered web scraping editor with visual workflow builder. Build, test &... |
|
Established |
| 239 |
minhlucvan/n8n-nodes-browserless
n8n node to interact with browserless instance |
|
Established |
| 240 |
lisc-tools/lisc
Literature Scanner: Automated collection & analyses of the scientific literature. |
|
Established |
| 241 |
ramonpaolo/api-b3
API Simples que retorna dados sobre tal ação/empresa da B3 |
|
Established |
| 242 |
Logan1x/Python-Scripts
Collection of Various Python Script's.💻 |
|
Established |
| 243 |
VIDA-NYU/ache
ACHE is a web crawler for domain-specific search. |
|
Established |
| 244 |
ryanirl/CraigslistScraper
Simple webscraper for Craigslist. |
|
Established |
| 245 |
34j/cached-historical-data-fetcher
Python utility for fetching any historical data using caching. Suitable for... |
|
Established |
| 246 |
City-Bureau/city-scrapers-cle
City Scrapers project for Cleveland |
|
Established |
| 247 |
tidyverse/rvest
Simple web scraping for R |
|
Established |
| 248 |
yuanxu-li/html-table-extractor
extract data from html table |
|
Established |
| 249 |
cityssm/wsib-clearance-check
A tool to programmatically scrape the clearance certificate status from the... |
|
Established |
| 250 |
html2rss/html2rss-configs
📇 A growing collection of html2rss feed configs. Generate configs with the... |
|
Established |
| 251 |
daneads/pypatent
Search for and retrieve US Patent and Trademark Office Patent Data |
|
Established |
| 252 |
recommend-games/board-game-scraper
Board game data scraper |
|
Established |
| 253 |
DeepFundAI/ai-browser
🤖 An AI-powered desktop automation agent built with Electron & Next.js -... |
|
Established |
| 254 |
scrapfly/python-scrapfly
Scrapfly Python SDK for headless browsers and proxy rotation |
|
Established |
| 255 |
HexNio/booking_scraper
A booking.com Web Scraper for Data Mining/Harvesting and Automation |
|
Established |
| 256 |
TheQmaks/justapk
Download any APK by package name. 6 sources, automatic fallback, Cloudflare... |
|
Established |
| 257 |
TiagoDanin/ScraperScript
ScraperScript is a query language for Web Scraping |
|
Established |
| 258 |
sbmagar13/sharesansar_datascrape
Sharesansar Nepal NEPSE daily share price data scraping with Python. Scrapes... |
|
Established |
| 259 |
CamouChat-Team/CamouChat
CamouChat : A stealth-first Python SDK for cross-platform messaging... |
|
Established |
| 260 |
platonai/PulsarRPAPro
Fully automated and hands-free, accurately extracting and understanding web... |
|
Established |
| 261 |
Hochfrequenz/ebdamame
Python library to scrape .docx files with "Entscheidungsbaumdiagramm" tables... |
|
Established |
| 262 |
daijro/hrequests
🚀 Web scraping for humans |
|
Established |
| 263 |
steel-dev/steel-cookbook
Examples and guides for using the Steel API |
|
Established |
| 264 |
Firmfox/Proxify
Proxify is an automated tool that collects and updates fresh SOCKS4, SOCKS5,... |
|
Established |
| 265 |
Hyper-Solutions/hyper-sdk-py
Python SDK for Bot Protection Bypass - Automate Akamai, Incapsula, Kasada,... |
|
Established |
| 266 |
postmodern/spidr
A versatile Ruby web spidering library that can spider a site, multiple... |
|
Established |
| 267 |
jaeyk/comp_thinking_social_science
Computational Thinking for Social Scientists book project |
|
Established |
| 268 |
hhuayuan/spiderbuf
Spiderbuf 是一个专注于 Python... |
|
Established |
| 269 |
0xSero/parchi
Your AI friend right in your browser |
|
Established |
| 270 |
FlowerEatsFish/books-com-tw-crawler
books.com.tw crawler 「博客來」資料爬蟲 |
|
Established |
| 271 |
kelvinzer0/curl-impersonate-php
PHP wrapper for curl-impersonate — mimic real browser TLS fingerprints to... |
|
Established |
| 272 |
gahabeen/jsonframe-cheerio
simple multi-level scraper json input/output for Cheerio |
|
Established |
| 273 |
Algebra-FUN/WeReadScan
扫描“微信读书”已购图书并下载本地PDF的爬虫 |
|
Established |
| 274 |
maxhumber/gazpacho
🥫 The simple, fast, and modern web scraping library |
|
Established |
| 275 |
cyterat/deepstate-map-data
DeepState Map | Occupied | GeoJSON Multipolygon | Daily update |
|
Established |
| 276 |
get-set-fetch/scraper
Nodejs web scraper. Contains a command line, docker container, terraform... |
|
Established |
| 277 |
devopsgroup-io/siteshooter
:camera: Automate full website screenshots and PDF generation with multiple... |
|
Established |
| 278 |
jurismarches/chopper
Chopper is a tool to extract elements from HTML by preserving ancestors and CSS rules |
|
Established |
| 279 |
angrykoala/wendigo
A proper monster for front-end automated testing |
|
Established |
| 280 |
stabldev/mangareader-api
A Python based web scraping api built with fastapi to get manga contents. |
|
Established |
| 281 |
deliton/idt
Image Dataset Tool (idt) is a cli tool designed to make the otherwise... |
|
Established |
| 282 |
tednaleid/ganda
fast cmd-line app that quickly requests millions of urls and can save/echo... |
|
Established |
| 283 |
milahu/aiohttp_chromium
aiohttp-like interface to chromium. based on selenium_driverless to bypass cloudflare |
|
Established |
| 284 |
tuhinpal/imdb-api
Serverless IMDB API powered by Cloudflare Worker |
|
Established |
| 285 |
sjdirect/abot
Cross Platform C# web crawler framework built for speed and flexibility.... |
|
Established |
| 286 |
xianhu/PSpider
简单易用的Python爬虫框架,QQ交流群:597510560 |
|
Established |
| 287 |
symfony/panther
A browser testing and web crawling library for PHP and Symfony |
|
Established |
| 288 |
lseffer/stock_screener
Picking stocks through various screening methods. Focus on Northern Europe. |
|
Established |
| 289 |
TrainingByPackt/Data-Wrangling-with-Python
Simplify your ETL processes with these hands-on data sanitation tips,... |
|
Established |
| 290 |
scrapehero/selectorlib
A library to read a YML file with Xpath or CSS Selectors and extract data... |
|
Established |
| 291 |
istresearch/scrapy-cluster
This Scrapy project uses Redis and Kafka to create a distributed on demand... |
|
Established |
| 292 |
je-suis-tm/web-scraping
Detailed web scraping tutorials for dummies with financial data crawlers on... |
|
Established |
| 293 |
0xjas0/Edu-Mail-Generator
Generate Free Edu Mail(s) within minutes |
|
Established |
| 294 |
spider-rs/spider-py
Spider ported to Python |
|
Established |
| 295 |
DBeath/feedsearch-crawler
Crawl sites for RSS, Atom, and JSON feeds. |
|
Established |
| 296 |
mov-cli/mov-cli
Watch everything from your terminal. |
|
Established |
| 297 |
vprusso/youtube_tutorials
Collection of scripts corresponding to LucidProgramming YouTube tutorials |
|
Established |
| 298 |
USCDataScience/sparkler
Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark. |
|
Established |
| 299 |
Yuvi9587/Kemono-Downloader
Kemono Downloader is a fast, powerful PyQt5 app for archiving content from a... |
|
Established |
| 300 |
tamimibrahim17/List-of-user-agents
List of major web + mobile browser user agent strings. +1 Bonus script to scrape :) |
|
Established |