All Perception Tools

10,540 tools ranked by quality score · Page 4 of 106

Showing 301–400 of 10,540
# Tool Score Tier
301 AlexMathew/scrapple

A framework for creating semi-automatic web content extractors

51
Established
302 josephlimtech/linkedin-profile-scraper-api

🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON.

51
Established
303 zyachel/libremdb

A free & open source IMDb front-end.

51
Established
304 alechilczenko/kimi

Attack Surface Discovery tool built on a microservice approach, utilizing...

51
Established
305 ssssssss-team/spider-flow

新一代爬虫平台,以图形化方式定义爬虫流程,不写代码即可完成爬虫。

51
Established
306 yuchiu/Netflix-Clone

Netflix like full-stack application with SPA client and backend implemented...

51
Established
307 scrapfly/Antibot-Detector

Real-time detection of anti-bot systems, CAPTCHAs & fingerprinting...

51
Established
308 schasins/helena

A Chrome extension for writing custom web scraping programs and web...

51
Established
309 covidatlas/coronadatascraper

COVID-19 Coronavirus data scraped from government and curated data sources.

51
Established
310 NikolaiT/GoogleScraper

A Python module to scrape several search engines (like Google, Yandex, Bing,...

51
Established
311 codingforentrepreneurs/30-Days-of-Python

Learn Python for the next 30 (or so) Days.

51
Established
312 damklis/DataEngineeringProject

Example end to end data engineering project.

51
Established
313 gildas-lormeau/single-file-cli

CLI tool for saving a faithful copy of a complete web page in a single HTML...

51
Established
314 tholian-network/stealth

:rocket: Stealth - Secure, Peer-to-Peer, Private and Automateable Web...

51
Established
315 adithya-s-k/omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia...

51
Established
316 MorvanZhou/easy-scraping-tutorial

Simple but useful Python web scraping tutorial code.

51
Established
317 s0rg/crawley

The unix-way web crawler

51
Established
318 Ryuk-me/Torrent-Api-py

An Unofficial API for 1337x, Piratebay, Nyaasi, Torlock, Torrent Galaxy,...

51
Established
319 austinoboyle/scrape-linkedin-selenium

`scrape_linkedin` is a python package that allows you to scrape personal...

51
Established
320 MBach/LeMondeRssReader

:newspaper: Read RSS feed from LeMonde.fr and display news inside the App

50
Established
321 apify/actor-whitepaper

This whitepaper describes a new concept for building serverless microapps...

50
Established
322 techwizrd/MangaFox-Download-Script

A script that lets you download mangas from mangafox.com

50
Established
323 fluquid/extract-social-media

Extract social media links and account names from websites.

50
Established
324 chazeon/python-vistopia

看理想 Python 客户端 / 下载器,下载看理想的音频和文稿

50
Established
325 EchterAlsFake/youporn_api

Unofficial Python client for youporn.com: search, metadata extraction, and...

50
Established
326 sqdshguy/wreq-js

HTTP client for Node.js with browser TLS fingerprint impersonation

50
Established
327 kameleo-io/local-api-client-python

Official Python library for interacting with Kameleo Client

50
Established
328 MontFerret/cli

Ferret CLI

50
Established
329 EchterAlsFake/xfreehd_api

Unofficial Python client for xfreehd.com: search, metadata extraction, and...

50
Established
330 FramedStone/SassyNic

Multiple chrome extensions that speed up the tasks within MMU CliC.

50
Established
331 nuzulul/telegram-scraper

A simple Telegram channel scraper

50
Established
332 Evil0ctal/Douyin_TikTok_Download_API

🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音、快手、TikTok、Bilibili数据爬取工具,支持API调用...

50
Established
333 erictleung/pixarfilms

:movie_camera: R data package to explore Pixar films, the people, and reception data

50
Established
334 antiops/tiktok-trending-data

Scraping the TikTok discovery web API every 15 minutes using Github Actions...

50
Established
335 acoyfellow/unsurf

Turn any website into a typed API

50
Established
336 iawia002/Lulu

[Unmaintained] A simple and clean video/music/image downloader 👾

50
Established
337 david-dick/firefox-marionette

This is a client module to automate the Mozilla Firefox browser via the...

50
Established
338 Gradleless/nuxt-data-parser

Parse and traverse the SSR payload NUXT_DATA - Useful for scraping Nuxt.js websites

50
Established
339 joedockrill/jmd_imagescraper

Image scraping library for creating deep learning datasets

50
Established
340 MontFerret/worker

Containerized Ferret worker

50
Established
341 andrelmlins/freesoccer

:soccer: Free API with results from national soccer competitions

50
Established
342 html2rss/html2rss

📰 Build RSS 2.0 feeds from websites (and JSON APIs) automatically or with a...

50
Established
343 davidteather/everything-web-scraping

Learn everything web scraping with David Teather Codes on YouTube

50
Established
344 danieldotnl/ha-multiscrape

Home Assistant custom component for scraping (html, xml or json) multiple...

50
Established
345 yusuzech/r-web-scraping-cheat-sheet

Guide, reference and cheatsheet on web scraping using rvest, httr and Rselenium.

50
Established
346 yubunus/Uber-Eats-Account-Generator

Generate Uber Eats accounts using IMAP or Hotmail emails using mobile packets

50
Established
347 mirusu400/Pinterest-infinite-crawler

An infinite Pinterest crawler/scraper. Crawl image with inifnite-scroll!

50
Established
348 ulixee/secret-agent

The web scraper that's nearly impossible to block - now called @ulixee/hero

50
Established
349 Decodo/Decodo

HTTP(S)/SOCKS5 rotating residential proxies - code examples & general information.

50
Established
350 y13sint/FreeQwenApi

Локальный API-прокси для Qwen AI с поддержкой сохранения контекста диалогов...

50
Established
351 Vinyzu/cursory

Generate 100% human-realistic Mouse Trajectories with Timings.

50
Established
352 buyukakyuz/email-sleuth

Discover and verify professional emails using names + domains

49
Emerging
353 da2vin/Sasila

一个灵活、友好的爬虫框架

49
Emerging
354 amoudgl/short-jokes-dataset

Python scripts for building 'Short Jokes' dataset, featured on Kaggle

49
Emerging
355 my8100/scrapyd-cluster-on-heroku

Set up free and scalable Scrapyd cluster for distributed web-crawling with...

49
Emerging
356 d-Rickyy-b/pastepwn

Python framework to scrape Pastebin pastes and analyze them

49
Emerging
357 VolkanSah/Auto-Proxy-Fetcher

Automatically fetch and update proxy lists from multiple sources every 6...

49
Emerging
358 supadata-ai/py

Official Python SDK for the Supadata API.

49
Emerging
359 elliotxx/zhihu-crawler-people

A simple distributed crawler for zhihu && data analysis

49
Emerging
360 ZenRows/zenrows-python-sdk

SDK to access ZenRows API directly from Python. We handle proxies rotation,...

49
Emerging
361 shurco/goClone

🌱 goClone - clone websites in seconds

49
Emerging
362 miroshnikov/scrapyteer

Web crawling & scraping framework for Node.js on top of headless Chrome browser

49
Emerging
363 MarketingPipeline/Python-Selenium-Action

Run Selenium with Python via Github Actions using Headless or Non-Headless browsers!

49
Emerging
364 godkingjay/selenium-twitter-scraper

This is a Twitter Scraper which uses Selenium for scraping tweets. It is...

49
Emerging
365 codingforentrepreneurs/Web-Scraping

Learn how to leverage Python's amazing tools to scrape data from other...

49
Emerging
366 CloakHQ/CloakBrowser-Manager

Web-based browser profile manager for CloakBrowser — create, launch, and...

49
Emerging
367 MLArtist/WebScraper

Python-based web crawling script with randomized intervals, user-agent...

49
Emerging
368 Hecate2/Ignareo-ISML-auto-voter

Ignareo the Carillon, a web crawler/spider template of ultimate high...

49
Emerging
369 INNOVINATI/microwler

A micro-framework for asynchronous deep crawls and web scraping with Python

49
Emerging
370 privacy-tech-lab/gpc-web-crawler

GPC Web Crawler for detecting websites' compliance with GPC privacy...

49
Emerging
371 N4rr34n6/TikTok-User-Info-Scraper

TikTok User Info Scraper allows you to fetch detailed information about...

49
Emerging
372 alyakhtar/Katastrophe

Command Line Tool to download torrents

49
Emerging
373 kameleo-io/kameleo

Anti-detect browser for web scraping and automation. Engine-level...

49
Emerging
374 ppizarror/PyMultiDictionary

PyMultiDictionary is a dictionary module that gets meanings, translations,...

49
Emerging
375 OSINT-TECHNOLOGIES/dpulse

DPULSE - Tool for complex approach to domain OSINT

49
Emerging
376 apify/apify-zapier-integration

Apify integration for Zapier

49
Emerging
377 AgriciDaniel/google-ai-studio-n8n-google-maps-scraper

A no-code Google Maps lead scraper built with Google AI Studio (Gemini) and...

49
Emerging
378 rivermont/spidy

The simple, easy to use command line web crawler.

49
Emerging
379 rayobyte-data/rayobrowse

Stealth Chromium browser for large-scale web scraping.

49
Emerging
380 macloo/python-adv-web-apps

Updated python-beginners docs and examples

49
Emerging
381 crawlab-team/crawlab-lite

Lite version of Crawlab. 轻量版 Crawlab 爬虫管理平台

49
Emerging
382 EchterAlsFake/porntrex_api

Unofficial Python client for porntrex.com: search, metadata extraction, and...

49
Emerging
383 batuhaniskr/twitter-intelligence

Twitter Intelligence OSINT project performs tracking and analysis of the Twitter

49
Emerging
384 NLPatVCU/PaperScraper

A web scraping tool to systematically extract the text of scientific papers...

49
Emerging
385 vital987/chrome-novnc

Chromium via noVNC (Browser in Browser)

49
Emerging
386 rflechner/ScrapySharp

reborn of https://bitbucket.org/rflechner/scrapysharp

49
Emerging
387 infinilabs/crawler

🕷️ An easy-to-use spider written in Golang. (previous named GOPA.)

49
Emerging
388 brandon93s/html-table-to-json

Generate JSON representations of HTML tables

49
Emerging
389 pim97/scrappey.js

Scrappey.js: A versatile JavaScript wrapper for Scrappey API for solving...

49
Emerging
390 kpliuta/termux-web-scraper

A robust and flexible web scraping framework designed to run on Android...

49
Emerging
391 abo123456789/leek

Distributed task redisqueue(最简单python分布式函数调度框架)

49
Emerging
392 Dymerz/RSI-Scraper

Web Scaper for RSI

49
Emerging
393 csu/quora-api

An unofficial API for Quora.

49
Emerging
394 mhwgoo/cambridge

Terminal version of Cambridge Dictionary by default. Also supports the...

49
Emerging
395 FaustoS88/PinescriptV6-docs-crawler

A Python tool for crawling and processing TradingView's PineScript V6...

49
Emerging
396 DataCrawl-AI/datacrawl

A simple and easy to use web crawler for Python

49
Emerging
397 umbrellaDocumentation/Web-Data-Scraper

Web Data Scraper - no-code internet scraping. Extract and export to CSV,...

49
Emerging
398 JonasCz/save-for-offline

Android app for saving webpages for offline reading.

48
Emerging
399 Raccoon254/Aviator-Automated-Betika-Bot

Automated Aviator Betting Bot for Betika, Spribe & Other Aviator-style sites...

48
Emerging
400 p-ranav/saveddit

Bulk Downloader for Reddit

48
Emerging