fidelity/textwiser

[AAAI 2021] TextWiser: Text Featurization Library

/ 100

Established

When you need to turn raw text documents into numerical data for analysis or machine learning, this tool helps you choose and apply various text "featurization" methods. It takes your text (like customer reviews, news articles, or reports) and converts it into structured numerical representations. Data scientists, machine learning engineers, and NLP practitioners use this to prepare text for tasks like classification, clustering, or search.

Available on PyPI.

Use this if you need a flexible way to transform unstructured text into numerical features using a wide array of methods, including advanced pretrained models, and want to optimize these features for your specific analytical tasks.

Not ideal if you're looking for an off-the-shelf solution for a specific natural language processing task (e.g., sentiment analysis) without needing to customize the underlying feature extraction.

natural-language-processing text-analytics machine-learning-engineering data-science feature-engineering

Maintenance 10 / 25

Adoption 8 / 25

Maturity 25 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Related tools

RandolphVI/Multi-Label-Text-Classification

About Muti-Label Text Classification Based on Neural Network.

ThilinaRajapakse/pytorch-transformers-classification

Based on the Pytorch-Transformers library by HuggingFace. To be used as a starting point for...

ntumlgroup/LibMultiLabel

A library for multi-class and multi-label classification

xuyige/BERT4doc-Classification

Code and source for paper ``How to Fine-Tune BERT for Text Classification?``

allenai/scibert

A BERT model for scientific text.

Explore NLP Tools

All categories Trending NLP directory Insights