All Perception Tools
10,540 tools ranked by quality score · Page 4 of 106
| # | Tool | Score | Tier |
|---|---|---|---|
| 301 |
AlexMathew/scrapple
A framework for creating semi-automatic web content extractors |
|
Established |
| 302 |
josephlimtech/linkedin-profile-scraper-api
🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. |
|
Established |
| 303 |
zyachel/libremdb
A free & open source IMDb front-end. |
|
Established |
| 304 |
alechilczenko/kimi
Attack Surface Discovery tool built on a microservice approach, utilizing... |
|
Established |
| 305 |
ssssssss-team/spider-flow
新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。 |
|
Established |
| 306 |
yuchiu/Netflix-Clone
Netflix like full-stack application with SPA client and backend implemented... |
|
Established |
| 307 |
scrapfly/Antibot-Detector
Real-time detection of anti-bot systems, CAPTCHAs & fingerprinting... |
|
Established |
| 308 |
schasins/helena
A Chrome extension for writing custom web scraping programs and web... |
|
Established |
| 309 |
covidatlas/coronadatascraper
COVID-19 Coronavirus data scraped from government and curated data sources. |
|
Established |
| 310 |
NikolaiT/GoogleScraper
A Python module to scrape several search engines (like Google, Yandex, Bing,... |
|
Established |
| 311 |
codingforentrepreneurs/30-Days-of-Python
Learn Python for the next 30 (or so) Days. |
|
Established |
| 312 |
damklis/DataEngineeringProject
Example end to end data engineering project. |
|
Established |
| 313 |
gildas-lormeau/single-file-cli
CLI tool for saving a faithful copy of a complete web page in a single HTML... |
|
Established |
| 314 |
tholian-network/stealth
:rocket: Stealth - Secure, Peer-to-Peer, Private and Automateable Web... |
|
Established |
| 315 |
adithya-s-k/omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia... |
|
Established |
| 316 |
MorvanZhou/easy-scraping-tutorial
Simple but useful Python web scraping tutorial code. |
|
Established |
| 317 |
s0rg/crawley
The unix-way web crawler |
|
Established |
| 318 |
Ryuk-me/Torrent-Api-py
An Unofficial API for 1337x, Piratebay, Nyaasi, Torlock, Torrent Galaxy,... |
|
Established |
| 319 |
austinoboyle/scrape-linkedin-selenium
`scrape_linkedin` is a python package that allows you to scrape personal... |
|
Established |
| 320 |
MBach/LeMondeRssReader
:newspaper: Read RSS feed from LeMonde.fr and display news inside the App |
|
Established |
| 321 |
apify/actor-whitepaper
This whitepaper describes a new concept for building serverless microapps... |
|
Established |
| 322 |
techwizrd/MangaFox-Download-Script
A script that lets you download mangas from mangafox.com |
|
Established |
| 323 |
fluquid/extract-social-media
Extract social media links and account names from websites. |
|
Established |
| 324 |
chazeon/python-vistopia
看理想 Python 客户端 / 下载器,下载看理想的音频和文稿 |
|
Established |
| 325 |
EchterAlsFake/youporn_api
Unofficial Python client for youporn.com: search, metadata extraction, and... |
|
Established |
| 326 |
sqdshguy/wreq-js
HTTP client for Node.js with browser TLS fingerprint impersonation |
|
Established |
| 327 |
kameleo-io/local-api-client-python
Official Python library for interacting with Kameleo Client |
|
Established |
| 328 |
MontFerret/cli
Ferret CLI |
|
Established |
| 329 |
EchterAlsFake/xfreehd_api
Unofficial Python client for xfreehd.com: search, metadata extraction, and... |
|
Established |
| 330 |
FramedStone/SassyNic
Multiple chrome extensions that speed up the tasks within MMU CliC. |
|
Established |
| 331 |
nuzulul/telegram-scraper
A simple Telegram channel scraper |
|
Established |
| 332 |
Evil0ctal/Douyin_TikTok_Download_API
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用... |
|
Established |
| 333 |
erictleung/pixarfilms
:movie_camera: R data package to explore Pixar films, the people, and reception data |
|
Established |
| 334 |
antiops/tiktok-trending-data
Scraping the TikTok discovery web API every 15 minutes using Github Actions... |
|
Established |
| 335 |
acoyfellow/unsurf
Turn any website into a typed API |
|
Established |
| 336 |
iawia002/Lulu
[Unmaintained] A simple and clean video/music/image downloader 👾 |
|
Established |
| 337 |
david-dick/firefox-marionette
This is a client module to automate the Mozilla Firefox browser via the... |
|
Established |
| 338 |
Gradleless/nuxt-data-parser
Parse and traverse the SSR payload NUXT_DATA - Useful for scraping Nuxt.js websites |
|
Established |
| 339 |
joedockrill/jmd_imagescraper
Image scraping library for creating deep learning datasets |
|
Established |
| 340 |
MontFerret/worker
Containerized Ferret worker |
|
Established |
| 341 |
andrelmlins/freesoccer
:soccer: Free API with results from national soccer competitions |
|
Established |
| 342 |
html2rss/html2rss
📰 Build RSS 2.0 feeds from websites (and JSON APIs) automatically or with a... |
|
Established |
| 343 |
davidteather/everything-web-scraping
Learn everything web scraping with David Teather Codes on YouTube |
|
Established |
| 344 |
danieldotnl/ha-multiscrape
Home Assistant custom component for scraping (html, xml or json) multiple... |
|
Established |
| 345 |
yusuzech/r-web-scraping-cheat-sheet
Guide, reference and cheatsheet on web scraping using rvest, httr and Rselenium. |
|
Established |
| 346 |
yubunus/Uber-Eats-Account-Generator
Generate Uber Eats accounts using IMAP or Hotmail emails using mobile packets |
|
Established |
| 347 |
mirusu400/Pinterest-infinite-crawler
An infinite Pinterest crawler/scraper. Crawl image with inifnite-scroll! |
|
Established |
| 348 |
ulixee/secret-agent
The web scraper that's nearly impossible to block - now called @ulixee/hero |
|
Established |
| 349 |
Decodo/Decodo
HTTP(S)/SOCKS5 rotating residential proxies - code examples & general information. |
|
Established |
| 350 |
y13sint/FreeQwenApi
Локальный API-прокси для Qwen AI с поддержкой сохранения контекста диалогов... |
|
Established |
| 351 |
Vinyzu/cursory
Generate 100% human-realistic Mouse Trajectories with Timings. |
|
Established |
| 352 |
buyukakyuz/email-sleuth
Discover and verify professional emails using names + domains |
|
Emerging |
| 353 |
da2vin/Sasila
一个灵活、友好的爬虫框架 |
|
Emerging |
| 354 |
amoudgl/short-jokes-dataset
Python scripts for building 'Short Jokes' dataset, featured on Kaggle |
|
Emerging |
| 355 |
my8100/scrapyd-cluster-on-heroku
Set up free and scalable Scrapyd cluster for distributed web-crawling with... |
|
Emerging |
| 356 |
d-Rickyy-b/pastepwn
Python framework to scrape Pastebin pastes and analyze them |
|
Emerging |
| 357 |
VolkanSah/Auto-Proxy-Fetcher
Automatically fetch and update proxy lists from multiple sources every 6... |
|
Emerging |
| 358 |
supadata-ai/py
Official Python SDK for the Supadata API. |
|
Emerging |
| 359 |
elliotxx/zhihu-crawler-people
A simple distributed crawler for zhihu && data analysis |
|
Emerging |
| 360 |
ZenRows/zenrows-python-sdk
SDK to access ZenRows API directly from Python. We handle proxies rotation,... |
|
Emerging |
| 361 |
shurco/goClone
🌱 goClone - clone websites in seconds |
|
Emerging |
| 362 |
miroshnikov/scrapyteer
Web crawling & scraping framework for Node.js on top of headless Chrome browser |
|
Emerging |
| 363 |
MarketingPipeline/Python-Selenium-Action
Run Selenium with Python via Github Actions using Headless or Non-Headless browsers! |
|
Emerging |
| 364 |
godkingjay/selenium-twitter-scraper
This is a Twitter Scraper which uses Selenium for scraping tweets. It is... |
|
Emerging |
| 365 |
codingforentrepreneurs/Web-Scraping
Learn how to leverage Python's amazing tools to scrape data from other... |
|
Emerging |
| 366 |
CloakHQ/CloakBrowser-Manager
Web-based browser profile manager for CloakBrowser — create, launch, and... |
|
Emerging |
| 367 |
MLArtist/WebScraper
Python-based web crawling script with randomized intervals, user-agent... |
|
Emerging |
| 368 |
Hecate2/Ignareo-ISML-auto-voter
Ignareo the Carillon, a web crawler/spider template of ultimate high... |
|
Emerging |
| 369 |
INNOVINATI/microwler
A micro-framework for asynchronous deep crawls and web scraping with Python |
|
Emerging |
| 370 |
privacy-tech-lab/gpc-web-crawler
GPC Web Crawler for detecting websites' compliance with GPC privacy... |
|
Emerging |
| 371 |
N4rr34n6/TikTok-User-Info-Scraper
TikTok User Info Scraper allows you to fetch detailed information about... |
|
Emerging |
| 372 |
alyakhtar/Katastrophe
Command Line Tool to download torrents |
|
Emerging |
| 373 |
kameleo-io/kameleo
Anti-detect browser for web scraping and automation. Engine-level... |
|
Emerging |
| 374 |
ppizarror/PyMultiDictionary
PyMultiDictionary is a dictionary module that gets meanings, translations,... |
|
Emerging |
| 375 |
OSINT-TECHNOLOGIES/dpulse
DPULSE - Tool for complex approach to domain OSINT |
|
Emerging |
| 376 |
apify/apify-zapier-integration
Apify integration for Zapier |
|
Emerging |
| 377 |
AgriciDaniel/google-ai-studio-n8n-google-maps-scraper
A no-code Google Maps lead scraper built with Google AI Studio (Gemini) and... |
|
Emerging |
| 378 |
rivermont/spidy
The simple, easy to use command line web crawler. |
|
Emerging |
| 379 |
rayobyte-data/rayobrowse
Stealth Chromium browser for large-scale web scraping. |
|
Emerging |
| 380 |
macloo/python-adv-web-apps
Updated python-beginners docs and examples |
|
Emerging |
| 381 |
crawlab-team/crawlab-lite
Lite version of Crawlab. 轻量版 Crawlab 爬虫管理平台 |
|
Emerging |
| 382 |
EchterAlsFake/porntrex_api
Unofficial Python client for porntrex.com: search, metadata extraction, and... |
|
Emerging |
| 383 |
batuhaniskr/twitter-intelligence
Twitter Intelligence OSINT project performs tracking and analysis of the Twitter |
|
Emerging |
| 384 |
NLPatVCU/PaperScraper
A web scraping tool to systematically extract the text of scientific papers... |
|
Emerging |
| 385 |
vital987/chrome-novnc
Chromium via noVNC (Browser in Browser) |
|
Emerging |
| 386 |
rflechner/ScrapySharp
reborn of https://bitbucket.org/rflechner/scrapysharp |
|
Emerging |
| 387 |
infinilabs/crawler
🕷️ An easy-to-use spider written in Golang. (previous named GOPA.) |
|
Emerging |
| 388 |
brandon93s/html-table-to-json
Generate JSON representations of HTML tables |
|
Emerging |
| 389 |
pim97/scrappey.js
Scrappey.js: A versatile JavaScript wrapper for Scrappey API for solving... |
|
Emerging |
| 390 |
kpliuta/termux-web-scraper
A robust and flexible web scraping framework designed to run on Android... |
|
Emerging |
| 391 |
abo123456789/leek
Distributed task redisqueue(最简单python分布式函数调度框架) |
|
Emerging |
| 392 |
Dymerz/RSI-Scraper
Web Scaper for RSI |
|
Emerging |
| 393 |
csu/quora-api
An unofficial API for Quora. |
|
Emerging |
| 394 |
mhwgoo/cambridge
Terminal version of Cambridge Dictionary by default. Also supports the... |
|
Emerging |
| 395 |
FaustoS88/PinescriptV6-docs-crawler
A Python tool for crawling and processing TradingView's PineScript V6... |
|
Emerging |
| 396 |
DataCrawl-AI/datacrawl
A simple and easy to use web crawler for Python |
|
Emerging |
| 397 |
umbrellaDocumentation/Web-Data-Scraper
Web Data Scraper - no-code internet scraping. Extract and export to CSV,... |
|
Emerging |
| 398 |
JonasCz/save-for-offline
Android app for saving webpages for offline reading. |
|
Emerging |
| 399 |
Raccoon254/Aviator-Automated-Betika-Bot
Automated Aviator Betting Bot for Betika, Spribe & Other Aviator-style sites... |
|
Emerging |
| 400 |
p-ranav/saveddit
Bulk Downloader for Reddit |
|
Emerging |