yusuzech/r-web-scraping-cheat-sheet

Guide, reference and cheatsheet on web scraping using rvest, httr and Rselenium.

/ 100

Established

This is a comprehensive guide for anyone looking to extract data from websites using R. It details how to use `rvest`, `httr`, and `RSelenium` to turn web page content into structured data like lists or data frames. Data scientists, researchers, or analysts who work with R and need to gather information directly from web sources will find this useful.

399 stars. No commits in the last 6 months.

Use this if you are an R user and need to systematically collect data from websites, ranging from simple static pages to complex, JavaScript-heavy sites or those requiring login.

Not ideal if you prefer Python for web scraping, or if you only need a very basic one-off data pull that can be done manually.

data-collection web-scraping R-programming data-acquisition research-data

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 24 / 25

How are scores calculated?

Stars

399

Forks

101

Language

License

MIT

Featured in

Giving AI Agents Eyes: Browser Automation in 2026

Related tools

seleniumbase/SeleniumBase

APIs for browser automation, testing, and bypassing bot-detection.

apify/crawlee-python

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers....

intoli/user-agents

A JavaScript library for generating random user agents with data that's updated daily.

apify/crawlee

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In...

Kaliiiiiiiiii-Vinyzu/patchright

Undetected version of the Playwright testing and automation library.

Explore Perception Tools

All categories Trending Perception directory Insights