dlite-tools/NLPiper
NLPiper is a package that agglomerates different NLP tools and applies their transformations in the target document.
This helps data scientists, researchers, or anyone working with text prepare their data for analysis. It takes raw text documents, like articles or social media posts, and cleans, standardizes, and breaks them down into individual words or phrases. The output is a structured document ready for tasks like sentiment analysis, topic modeling, or machine translation.
No commits in the last 6 months. Available on PyPI.
Use this if you need to systematically clean, normalize, and tokenize text data from various sources as a first step in your natural language processing workflow.
Not ideal if you're looking for an out-of-the-box solution that performs advanced NLP tasks like sentiment analysis or summarization without additional custom development.
Stars
19
Forks
1
Language
Python
License
MIT
Category
Last pushed
Aug 25, 2023
Commits (30d)
0
Dependencies
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/dlite-tools/NLPiper"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
chrismattmann/tika-python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called...
sloria/TextBlob
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase...
cltk/cltk
The Classical Language Toolkit
allenai/scispacy
A full spaCy pipeline and models for scientific/biomedical documents.
wi2trier/cbrkit
Customizable Case-Based Reasoning (CBR) toolkit for Python with a built-in API and CLI.