Riccorl/ipa
NLP Preprocessing Pipeline Wrappers
This tool helps data scientists and NLP practitioners prepare text data for analysis. It takes raw text as input and outputs structured tokens with information like part-of-speech tags and lemmas, making text ready for further machine learning or linguistic tasks. It's designed for those who work with textual data and need to standardize it efficiently.
No commits in the last 6 months.
Use this if you need to quickly and consistently preprocess text data, extracting linguistic features like words, their grammatical roles, or base forms, and want the flexibility to easily switch between different underlying NLP libraries.
Not ideal if you're looking for a complete end-to-end natural language processing solution that includes advanced model training or complex semantic analysis out of the box.
Stars
11
Forks
—
Language
Python
License
—
Category
Last pushed
May 12, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/Riccorl/ipa"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
sloria/TextBlob
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase...
chrismattmann/tika-python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called...
cltk/cltk
The Classical Language Toolkit
allenai/scispacy
A full spaCy pipeline and models for scientific/biomedical documents.
wi2trier/cbrkit
Customizable Case-Based Reasoning (CBR) toolkit for Python with a built-in API and CLI.