SauravPattnaikCS60/Weighted-Class-Tfidf

Weighted Class TFIDF technique to deal with imbalanced datasets

36
/ 100
Emerging

When analyzing text data with categories (like sentiment or topic), and some categories have much less data than others, traditional text analysis can miss important keywords from the smaller categories. This tool takes your text data and its categories, and outputs a refined list of important keywords that better represent all categories, even the small ones. This is for data analysts or researchers working with text classification problems.

No commits in the last 6 months. Available on PyPI.

Use this if you are performing text classification on imbalanced datasets where some categories have significantly fewer examples than others, and you want to ensure relevant keywords from minority classes are included.

Not ideal if your text datasets are well-balanced across all categories, or if you are not performing category-based text analysis.

text-classification natural-language-processing imbalanced-data feature-selection text-analytics
Stale 6m No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 25 / 25
Community 6 / 25

How are scores calculated?

Stars

14

Forks

1

Language

Python

License

MIT

Last pushed

Nov 12, 2022

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/SauravPattnaikCS60/Weighted-Class-Tfidf"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.