serpapi/nokolexbor
High-performance HTML5 parser for Ruby based on Lexbor, with support for both CSS selectors and XPath.
This is a tool for Ruby developers who need to quickly process web pages or other HTML content. It takes HTML documents as input and allows you to extract specific data using CSS selectors or XPath expressions, outputting the desired content or structured data. This is ideal for developers building web scraping tools, data extraction pipelines, or any application that involves parsing and querying HTML in Ruby.
400 stars.
Use this if you are a Ruby developer building an application that needs to parse HTML and extract data quickly, especially if you are already using or familiar with Nokogiri.
Not ideal if you are not a Ruby developer or if your primary need is complex XML parsing rather than HTML.
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/serpapi/nokolexbor"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Altimis/Scweet
A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...
lexiforest/curl_cffi
Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...
plabayo/rama
modular service framework to move and transform network packets
scrapinghub/spidermon
Scrapy Extension for monitoring spiders execution.