gidlov/copycat

A PHP Scraping Class

42
/ 100
Emerging

This is a PHP class that helps developers extract specific pieces of information from web pages, even across tens of thousands of pages. It takes URLs (or search engine queries) and regular expressions as input, then outputs structured text data and can download associated files like images. It's designed for PHP developers who need to programmatically collect data from public websites.

No commits in the last 6 months.

Use this if you are a PHP developer needing to programmatically scrape specific data points from websites, download files from those sites, or find relevant pages using a search engine.

Not ideal if you need a simpler, less code-intensive solution for web scraping or if you are working in a language other than PHP.

PHP development web scraping data extraction web data collection content parsing
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 9 / 25
Maturity 16 / 25
Community 17 / 25

How are scores calculated?

Stars

73

Forks

13

Language

PHP

License

LGPL-3.0

Category

scraper

Last pushed

Sep 03, 2017

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/perception/gidlov/copycat"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.