All Perception Tools

10,540 tools ranked by quality score · Page 3 of 106

Showing 201–300 of 10,540
# Tool Score Tier
201 commoncrawl/news-crawl

News crawling with StormCrawler - stores content as WARC

55
Established
202 programminghistorian/ph-submissions

The repository and website hosting the peer review process for new...

55
Established
203 cumbucadev/cinemaempoa

Site que agrega filmes em cartaz em algumas das diversas salas de cinema de...

55
Established
204 alpkeskin/rota

A high-performance proxy rotation engine with automated IP management and...

55
Established
205 yujiosaka/headless-chrome-crawler

Distributed crawler powered by Headless Chrome

54
Established
206 go-rod/rod

A Chrome DevTools Protocol driver for web automation and scraping.

54
Established
207 vakhov/fresh-proxy-list

Provides a list of fresh, working proxy servers (HTTP, HTTPS, SOCKS4 &...

54
Established
208 playwright-community/playwright-go

Playwright for Go a browser automation library to control Chromium, Firefox...

54
Established
209 EchterAlsFake/spankbang_api

A Python API for the porn site spankbang.com

54
Established
210 snooppr/shotstars

An advanced tool for checking GitHub repositories, with star statistics,...

54
Established
211 wael-sudo2/facebook-page-info-scraper

Free Facebook pages MetaData Scraping Library - Unlimited Calls

54
Established
212 sebastiancarlos/beachpatrol

🏝️ A CLI Tool to Automate Your Everyday Web Browser.

54
Established
213 wajik45/wajik-anime-api

REST API streaming dan download Anime subtitle Indonesia | sub Indo

54
Established
214 saifalisew1508/TelegramAdderTool

An Telegram Mass Members Adding/Scraping Tool Written In Python Using...

54
Established
215 ArchiveBox/abx-dl

⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like...

54
Established
216 lorey/socials

👨‍👩‍👦 Python library and CLI to turn URLs into structured social media profiles.

54
Established
217 DavidWittman/requests-random-user-agent

Configures the requests library to randomly select a desktop User-Agent

54
Established
218 dsheiko/puppetry

Web testing solution for non-developers on top of Puppeteer and Jest

54
Established
219 SoheilKhodayari/JAW

JAW: A Graph-based Security Analysis Framework for Client-side JavaScript

54
Established
220 ArchiveTeam/wget-lua

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC...

54
Established
221 ivan-sincek/file-scraper

Scrape files for sensitive information, and generate an interactive HTML...

54
Established
222 pzaino/thecrowler

A Content Discovery and Development Platform. Empowering Cybersecurity, AI,...

54
Established
223 hmasdev/pyjpboatrace

pyjpboatrace :speedboat: provides you with useful tools for data analysis...

54
Established
224 Madi-S/Lead-Generation

Python script, which empowers people with no programming background to...

54
Established
225 spyoungtech/behave-webdriver

Selenium webdriver step library for use with the behave BDD testing framework

54
Established
226 ngongcm/public-api-list

A curated collection of the best free, freemium, and paid APIs for developers.

54
Established
227 microlinkhq/cloudflare-bot-directory

CloudFlare Radar verified bots directory – 500+ web crawlers and user agents as JSON.

54
Established
228 serpapi/google-search-results-java

Google Search Results JAVA API via SerpApi

54
Established
229 clemfromspace/scrapy-puppeteer

Scrapy + Puppeteer

54
Established
230 akvise/trends-checker

Python CLI for Google Trends analysis with advanced rate limiting, cookie...

53
Established
231 spyboy-productions/omnisci3nt

Omnisci3nt is an open-source web reconnaissance and intelligence tool for...

53
Established
232 bitmakerla/estela

estela, an elastic web scraping cluster 🕸

53
Established
233 Germey/AwesomeWebScraping

List of libraries, tools and APIs for web scraping and data processing.

53
Established
234 ToufiqQureshi/chuscraper

Stealth-native web & Android scraping framework powered by CDP and ADB with...

53
Established
235 ImranR98/Wealthsimpleton

A Python script that scrapes your Wealthsimple activity history and saves...

53
Established
236 OpenIsraeliSupermarkets/israeli-supermarket-scarpers

A python package with client to scrape the israeli supermarkets data

53
Established
237 jasonxtn/Argus

The Ultimate Information Gathering Toolkit

53
Established
238 ScrapeGraphAI/scrapecraft

🤖 AI-powered web scraping editor with visual workflow builder. Build, test &...

53
Established
239 minhlucvan/n8n-nodes-browserless

n8n node to interact with browserless instance

53
Established
240 lisc-tools/lisc

Literature Scanner: Automated collection & analyses of the scientific literature.

53
Established
241 ramonpaolo/api-b3

API Simples que retorna dados sobre tal ação/empresa da B3

53
Established
242 Logan1x/Python-Scripts

Collection of Various Python Script's.💻

53
Established
243 VIDA-NYU/ache

ACHE is a web crawler for domain-specific search.

53
Established
244 ryanirl/CraigslistScraper

Simple webscraper for Craigslist.

53
Established
245 34j/cached-historical-data-fetcher

Python utility for fetching any historical data using caching. Suitable for...

53
Established
246 City-Bureau/city-scrapers-cle

City Scrapers project for Cleveland

53
Established
247 tidyverse/rvest

Simple web scraping for R

53
Established
248 yuanxu-li/html-table-extractor

extract data from html table

53
Established
249 cityssm/wsib-clearance-check

A tool to programmatically scrape the clearance certificate status from the...

53
Established
250 html2rss/html2rss-configs

📇 A growing collection of html2rss feed configs. Generate configs with the...

53
Established
251 daneads/pypatent

Search for and retrieve US Patent and Trademark Office Patent Data

53
Established
252 recommend-games/board-game-scraper

Board game data scraper

52
Established
253 DeepFundAI/ai-browser

🤖 An AI-powered desktop automation agent built with Electron & Next.js -...

52
Established
254 scrapfly/python-scrapfly

Scrapfly Python SDK for headless browsers and proxy rotation

52
Established
255 HexNio/booking_scraper

A booking.com Web Scraper for Data Mining/Harvesting and Automation

52
Established
256 TheQmaks/justapk

Download any APK by package name. 6 sources, automatic fallback, Cloudflare...

52
Established
257 TiagoDanin/ScraperScript

ScraperScript is a query language for Web Scraping

52
Established
258 sbmagar13/sharesansar_datascrape

Sharesansar Nepal NEPSE daily share price data scraping with Python. Scrapes...

52
Established
259 CamouChat-Team/CamouChat

CamouChat : A stealth-first Python SDK for cross-platform messaging...

52
Established
260 platonai/PulsarRPAPro

Fully automated and hands-free, accurately extracting and understanding web...

52
Established
261 Hochfrequenz/ebdamame

Python library to scrape .docx files with "Entscheidungsbaumdiagramm" tables...

52
Established
262 daijro/hrequests

🚀 Web scraping for humans

52
Established
263 steel-dev/steel-cookbook

Examples and guides for using the Steel API

52
Established
264 Firmfox/Proxify

Proxify is an automated tool that collects and updates fresh SOCKS4, SOCKS5,...

52
Established
265 Hyper-Solutions/hyper-sdk-py

Python SDK for Bot Protection Bypass - Automate Akamai, Incapsula, Kasada,...

52
Established
266 postmodern/spidr

A versatile Ruby web spidering library that can spider a site, multiple...

52
Established
267 jaeyk/comp_thinking_social_science

Computational Thinking for Social Scientists book project

52
Established
268 hhuayuan/spiderbuf

Spiderbuf 是一个专注于 Python...

52
Established
269 0xSero/parchi

Your AI friend right in your browser

52
Established
270 FlowerEatsFish/books-com-tw-crawler

books.com.tw crawler 「博客來」資料爬蟲

51
Established
271 kelvinzer0/curl-impersonate-php

PHP wrapper for curl-impersonate — mimic real browser TLS fingerprints to...

51
Established
272 gahabeen/jsonframe-cheerio

simple multi-level scraper json input/output for Cheerio

51
Established
273 Algebra-FUN/WeReadScan

扫描“微信读书”已购图书并下载本地PDF的爬虫

51
Established
274 maxhumber/gazpacho

🥫 The simple, fast, and modern web scraping library

51
Established
275 cyterat/deepstate-map-data

DeepState Map | Occupied | GeoJSON Multipolygon | Daily update

51
Established
276 get-set-fetch/scraper

Nodejs web scraper. Contains a command line, docker container, terraform...

51
Established
277 devopsgroup-io/siteshooter

:camera: Automate full website screenshots and PDF generation with multiple...

51
Established
278 jurismarches/chopper

Chopper is a tool to extract elements from HTML by preserving ancestors and CSS rules

51
Established
279 angrykoala/wendigo

A proper monster for front-end automated testing

51
Established
280 stabldev/mangareader-api

A Python based web scraping api built with fastapi to get manga contents.

51
Established
281 deliton/idt

Image Dataset Tool (idt) is a cli tool designed to make the otherwise...

51
Established
282 tednaleid/ganda

fast cmd-line app that quickly requests millions of urls and can save/echo...

51
Established
283 milahu/aiohttp_chromium

aiohttp-like interface to chromium. based on selenium_driverless to bypass cloudflare

51
Established
284 tuhinpal/imdb-api

Serverless IMDB API powered by Cloudflare Worker

51
Established
285 sjdirect/abot

Cross Platform C# web crawler framework built for speed and flexibility....

51
Established
286 xianhu/PSpider

简单易用的Python爬虫框架,QQ交流群:597510560

51
Established
287 symfony/panther

A browser testing and web crawling library for PHP and Symfony

51
Established
288 lseffer/stock_screener

Picking stocks through various screening methods. Focus on Northern Europe.

51
Established
289 TrainingByPackt/Data-Wrangling-with-Python

Simplify your ETL processes with these hands-on data sanitation tips,...

51
Established
290 scrapehero/selectorlib

A library to read a YML file with Xpath or CSS Selectors and extract data...

51
Established
291 istresearch/scrapy-cluster

This Scrapy project uses Redis and Kafka to create a distributed on demand...

51
Established
292 je-suis-tm/web-scraping

Detailed web scraping tutorials for dummies with financial data crawlers on...

51
Established
293 0xjas0/Edu-Mail-Generator

Generate Free Edu Mail(s) within minutes

51
Established
294 spider-rs/spider-py

Spider ported to Python

51
Established
295 DBeath/feedsearch-crawler

Crawl sites for RSS, Atom, and JSON feeds.

51
Established
296 mov-cli/mov-cli

Watch everything from your terminal.

51
Established
297 vprusso/youtube_tutorials

Collection of scripts corresponding to LucidProgramming YouTube tutorials

51
Established
298 USCDataScience/sparkler

Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.

51
Established
299 Yuvi9587/Kemono-Downloader

Kemono Downloader is a fast, powerful PyQt5 app for archiving content from a...

51
Established
300 tamimibrahim17/List-of-user-agents

List of major web + mobile browser user agent strings. +1 Bonus script to scrape :)

51
Established