Riccorl/ipa

NLP Preprocessing Pipeline Wrappers

/ 100

Experimental

This tool helps data scientists and NLP practitioners prepare text data for analysis. It takes raw text as input and outputs structured tokens with information like part-of-speech tags and lemmas, making text ready for further machine learning or linguistic tasks. It's designed for those who work with textual data and need to standardize it efficiently.

No commits in the last 6 months.

Use this if you need to quickly and consistently preprocess text data, extracting linguistic features like words, their grammatical roles, or base forms, and want the flexibility to easily switch between different underlying NLP libraries.

Not ideal if you're looking for a complete end-to-end natural language processing solution that includes advanced model training or complex semantic analysis out of the box.

text-preparation linguistic-analysis data-cleaning NLP-workflow computational-linguistics

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 8 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

—

Higher-rated alternatives

sloria/TextBlob

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase...

chrismattmann/tika-python

Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called...

cltk/cltk

The Classical Language Toolkit

allenai/scispacy

A full spaCy pipeline and models for scientific/biomedical documents.

wi2trier/cbrkit

Customizable Case-Based Reasoning (CBR) toolkit for Python with a built-in API and CLI.

Explore NLP Tools

All categories Trending NLP directory Insights