SrikantShubam/WebSite-Classifier
WebSite-Classifier is a machine learning project that classifies websites based on their content. The repository includes code for web scraping, data cleaning, feature extraction, and model building using scikit-learn and pandas. In addition, the project includes a Flask module for deploying the model.
This project helps you automatically sort websites into predefined categories by analyzing their content. You input a website URL, and it outputs the classification (e.g., 'news', 'e-commerce', 'blog'). This is useful for anyone who needs to organize or analyze many websites, such as researchers, content strategists, or marketing professionals.
No commits in the last 6 months.
Use this if you need to quickly categorize a large number of websites based on what kind of information they contain.
Not ideal if you need to classify websites into highly specific, niche categories that would require specialized training data.
Stars
10
Forks
2
Language
CSS
License
MIT
Category
Last pushed
Feb 29, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/SrikantShubam/WebSite-Classifier"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
jenojp/negspacy
spaCy pipeline object for negating concepts in text
Mascerade/supervised-product-matching
⚖️ Neural network for product matching, aka classifying whether two product titles represent the...
dmlls/negate
negate_sentence(A Python module that doesn't negate sentences.)
sugatagh/E-commerce-Text-Classification
Proper categorization of e-commerce products enhances the user experience and achieves better...
MaggieMeow/neko
Product categorization using sequence-to-sequence models and others.