miroshnikov/scrapyteer
Web crawling & scraping framework for Node.js on top of headless Chrome browser
This tool helps you gather specific information from websites, including those with dynamic content like online stores or news sites. You provide a set of website links and rules for what data to extract (like product names, prices, or article text), and it outputs that structured data as a JSON or JSON Lines file. Anyone needing to collect large amounts of public web data for analysis, market research, or content aggregation would use this.
Available on npm.
Use this if you need to extract structured data from websites, especially those that load content using JavaScript, and want to define a clear, repeatable process for doing so.
Not ideal if you only need to scrape a few pages manually, or if you prefer a graphical user interface over defining your scraping workflow with code.
Stars
20
Forks
1
Language
TypeScript
License
MIT
Category
Last pushed
Mar 15, 2026
Commits (30d)
0
Dependencies
3
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/miroshnikov/scrapyteer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
seleniumbase/SeleniumBase
APIs for browser automation, testing, and bypassing bot-detection.
apify/crawlee-python
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers....
intoli/user-agents
A JavaScript library for generating random user agents with data that's updated daily.
apify/crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In...
Kaliiiiiiiiii-Vinyzu/patchright
Undetected version of the Playwright testing and automation library.