StatguyUser/TextFeatureSelection
Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models
When building text classification models, this library helps you identify the most impactful words or phrases to improve your model's accuracy and efficiency. You provide your text documents and their corresponding categories, and it outputs a list of relevant terms, either with scores indicating their importance or as an optimized subset. This is ideal for data scientists, machine learning engineers, or researchers working with text data.
No commits in the last 6 months. Available on PyPI.
Use this if you need to select the most relevant words or features from your text data to build more accurate and interpretable text classification models.
Not ideal if your primary goal is general text analysis or topic modeling rather than improving a specific classification model.
Stars
53
Forks
5
Language
Python
License
MIT
Category
Last pushed
Jan 04, 2024
Commits (30d)
0
Dependencies
6
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/StatguyUser/TextFeatureSelection"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
ryanjgallagher/shifterator
Interpretable data visualizations for understanding how texts differ at the word level
HLasse/TextDescriptives
A Python library for calculating a large variety of metrics from text
jboynyc/textnets
Text analysis with networks.
DemetersSon83/Quantitative-Discursive-Analysis
A tool for quantitatively measuring discursive similarity between bodies of text.
sciknoworg/tib-sid
TIB-SID: A bilingual (English/German) dataset of library catalog records with GND subject...