StatguyUser/TextFeatureSelection

Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models

43
/ 100
Emerging

When building text classification models, this library helps you identify the most impactful words or phrases to improve your model's accuracy and efficiency. You provide your text documents and their corresponding categories, and it outputs a list of relevant terms, either with scores indicating their importance or as an optimized subset. This is ideal for data scientists, machine learning engineers, or researchers working with text data.

No commits in the last 6 months. Available on PyPI.

Use this if you need to select the most relevant words or features from your text data to build more accurate and interpretable text classification models.

Not ideal if your primary goal is general text analysis or topic modeling rather than improving a specific classification model.

text-classification feature-engineering natural-language-processing machine-learning-optimization
Stale 6m
Maintenance 0 / 25
Adoption 8 / 25
Maturity 25 / 25
Community 10 / 25

How are scores calculated?

Stars

53

Forks

5

Language

Python

License

MIT

Last pushed

Jan 04, 2024

Commits (30d)

0

Dependencies

6

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/StatguyUser/TextFeatureSelection"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.