rlayers/pawpaw

Text Processing & Segmentation Framework

46
/ 100
Emerging

Pawpaw helps you transform unstructured text, like legal documents or articles, into an organized, searchable tree-like structure. It takes raw text and applies rules to identify and segment parts like paragraphs, sentences, or specific terms, then outputs a hierarchical data structure. This is designed for analysts, researchers, or anyone who needs to extract and query information from large volumes of text.

No commits in the last 6 months. Available on PyPI.

Use this if you need to precisely segment unstructured text into meaningful, hierarchical components and then efficiently search or extract data from that structure.

Not ideal if you just need basic string manipulation or have already structured data that doesn't require deep, rules-based parsing.

text-analysis document-processing information-extraction lexical-parsing knowledge-graph-construction
Stale 6m
Maintenance 2 / 25
Adoption 7 / 25
Maturity 25 / 25
Community 12 / 25

How are scores calculated?

Stars

27

Forks

4

Language

Python

License

MIT

Last pushed

Sep 18, 2025

Commits (30d)

0

Dependencies

1

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/rlayers/pawpaw"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.