themains/piedomains

Classify the kind of content hosted by the domain using the domain name, and text and screenshot of the homepage.

46
/ 100
Emerging

This tool helps you understand the primary content of websites. You provide a list of domain names, and it tells you if each site is primarily for news, shopping, education, or one of 38 other categories, along with a confidence score. This is useful for market researchers, brand managers, or anyone needing to categorize large numbers of websites for analysis.

Available on PyPI.

Use this if you need to quickly and accurately determine the content type of many websites, including historical versions, without manually visiting each one.

Not ideal if you only need to classify a handful of websites, or if you require extremely niche content categories not covered by the existing 41 options.

market-research brand-safety competitor-analysis content-auditing web-intelligence
Maintenance 6 / 25
Adoption 6 / 25
Maturity 25 / 25
Community 9 / 25

How are scores calculated?

Stars

16

Forks

2

Language

Jupyter Notebook

License

MIT

Last pushed

Dec 20, 2025

Commits (30d)

0

Dependencies

12

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/themains/piedomains"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.