SauravPattnaikCS60/Weighted-Class-Tfidf
Weighted Class TFIDF technique to deal with imbalanced datasets
When analyzing text data with categories (like sentiment or topic), and some categories have much less data than others, traditional text analysis can miss important keywords from the smaller categories. This tool takes your text data and its categories, and outputs a refined list of important keywords that better represent all categories, even the small ones. This is for data analysts or researchers working with text classification problems.
No commits in the last 6 months. Available on PyPI.
Use this if you are performing text classification on imbalanced datasets where some categories have significantly fewer examples than others, and you want to ensure relevant keywords from minority classes are included.
Not ideal if your text datasets are well-balanced across all categories, or if you are not performing category-based text analysis.
Stars
14
Forks
1
Language
Python
License
MIT
Category
Last pushed
Nov 12, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/SauravPattnaikCS60/Weighted-Class-Tfidf"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
scikit-learn-contrib/imbalanced-learn
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
ZhiningLiu1998/imbalanced-ensemble
🛠️ Class-imbalanced Ensemble Learning Toolbox. | 类别不平衡/长尾机器学习库 [NeurIPS'25]
solegalli/machine-learning-imbalanced-data
Code repository for the online course Machine Learning with Imbalanced Data
ZhiningLiu1998/awesome-imbalanced-learning
😎 Everything about class-imbalanced/long-tail learning: papers, codes, frameworks, and libraries...
getspams/spams-python
Python interface for SPAMS (SPArse Modeling Software)