StatguyUser/TextFeatureSelection

Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models

/ 100

Emerging

When building text classification models, this library helps you identify the most impactful words or phrases to improve your model's accuracy and efficiency. You provide your text documents and their corresponding categories, and it outputs a list of relevant terms, either with scores indicating their importance or as an optimized subset. This is ideal for data scientists, machine learning engineers, or researchers working with text data.

No commits in the last 6 months. Available on PyPI.

Use this if you need to select the most relevant words or features from your text data to build more accurate and interpretable text classification models.

Not ideal if your primary goal is general text analysis or topic modeling rather than improving a specific classification model.

text-classification feature-engineering natural-language-processing machine-learning-optimization

Stale 6m

Maintenance 0 / 25

Adoption 8 / 25

Maturity 25 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

ryanjgallagher/shifterator

Interpretable data visualizations for understanding how texts differ at the word level

HLasse/TextDescriptives

A Python library for calculating a large variety of metrics from text

jboynyc/textnets

Text analysis with networks.

DemetersSon83/Quantitative-Discursive-Analysis

A tool for quantitatively measuring discursive similarity between bodies of text.

sciknoworg/tib-sid

TIB-SID: A bilingual (English/German) dataset of library catalog records with GND subject...

Explore NLP Tools

All categories Trending NLP directory Insights