SrikantShubam/WebSite-Classifier

WebSite-Classifier is a machine learning project that classifies websites based on their content. The repository includes code for web scraping, data cleaning, feature extraction, and model building using scikit-learn and pandas. In addition, the project includes a Flask module for deploying the model.

34
/ 100
Emerging

This project helps you automatically sort websites into predefined categories by analyzing their content. You input a website URL, and it outputs the classification (e.g., 'news', 'e-commerce', 'blog'). This is useful for anyone who needs to organize or analyze many websites, such as researchers, content strategists, or marketing professionals.

No commits in the last 6 months.

Use this if you need to quickly categorize a large number of websites based on what kind of information they contain.

Not ideal if you need to classify websites into highly specific, niche categories that would require specialized training data.

website-categorization content-analysis web-research information-organization
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 13 / 25

How are scores calculated?

Stars

10

Forks

2

Language

CSS

License

MIT

Last pushed

Feb 29, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/SrikantShubam/WebSite-Classifier"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.