loomchild/segment

Program used to split text into segments

40
/ 100
Emerging

This tool helps language professionals, localization managers, and content creators automatically split large blocks of text into smaller, manageable segments, like individual sentences. You provide your text along with a set of segmentation rules (in SRX format), and it outputs the text broken down into discrete segments, one per line or separated by custom markers. It's designed for anyone who needs to prepare text for processes like machine translation, linguistic analysis, or indexing.

No commits in the last 6 months.

Use this if you need a reliable way to automatically segment plain text based on industry-standard SRX rules, especially for preparing content for translation memory systems or linguistic workflows.

Not ideal if you need to preserve original text formatting (like rich text or XML), require highly specialized segmentation not covered by SRX, or are looking for a GUI-based desktop application.

localization translation-memory natural-language-processing text-preparation content-management
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 17 / 25

How are scores calculated?

Stars

28

Forks

10

Language

Java

License

MIT

Last pushed

Oct 27, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/loomchild/segment"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.