fidelity/textwiser
[AAAI 2021] TextWiser: Text Featurization Library
When you need to turn raw text documents into numerical data for analysis or machine learning, this tool helps you choose and apply various text "featurization" methods. It takes your text (like customer reviews, news articles, or reports) and converts it into structured numerical representations. Data scientists, machine learning engineers, and NLP practitioners use this to prepare text for tasks like classification, clustering, or search.
Available on PyPI.
Use this if you need a flexible way to transform unstructured text into numerical features using a wide array of methods, including advanced pretrained models, and want to optimize these features for your specific analytical tasks.
Not ideal if you're looking for an off-the-shelf solution for a specific natural language processing task (e.g., sentiment analysis) without needing to customize the underlying feature extraction.
Stars
58
Forks
9
Language
Python
License
Apache-2.0
Category
Last pushed
Feb 05, 2026
Commits (30d)
0
Dependencies
8
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/fidelity/textwiser"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
RandolphVI/Multi-Label-Text-Classification
About Muti-Label Text Classification Based on Neural Network.
ThilinaRajapakse/pytorch-transformers-classification
Based on the Pytorch-Transformers library by HuggingFace. To be used as a starting point for...
ntumlgroup/LibMultiLabel
A library for multi-class and multi-label classification
xuyige/BERT4doc-Classification
Code and source for paper ``How to Fine-Tune BERT for Text Classification?``
allenai/scibert
A BERT model for scientific text.