opensite-ai/domain_extractor
🔗 Lightweight Ruby library for parsing URLs and extracting domain components with accurate multi-part TLD support. Handles nested subdomains, query parameters, and URL normalization. Perfect for web scraping, analytics, and URL manipulation. Built on URI and public_suffix gem.
This library helps Ruby developers accurately break down website addresses (URLs) into their core parts, like subdomains, the main domain, and the top-level domain (e.g., .co.uk). It takes a URL as input and outputs its distinct components. This is ideal for developers building tools for web scraping, data analytics, or any application needing precise URL handling.
Use this if you are a Ruby developer building an application that needs to reliably parse complex URLs, identify multi-part top-level domains, and extract specific domain components for analytics or manipulation.
Not ideal if you are not a Ruby developer, or if you only need basic URL validation without detailed component extraction.
Stars
7
Forks
1
Language
Ruby
License
BSD-3-Clause
Category
Last pushed
Mar 12, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/perception/opensite-ai/domain_extractor"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
scrapy/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Altimis/Scweet
A simple and unlimited twitter scraper : scrape tweets, likes, retweets, following, followers,...
lexiforest/curl_cffi
Python binding for curl-impersonate fork via cffi. A http client that can impersonate browser...
plabayo/rama
modular service framework to move and transform network packets
scrapinghub/spidermon
Scrapy Extension for monitoring spiders execution.