scurkovic/cutters

A rule based sentence segmentation library.

21
/ 100
Experimental

This helps break down raw text into individual sentences, even when dealing with complex punctuation like abbreviations or quoted speech. You input a block of text in Croatian or English, and it outputs a list of clearly separated sentences. This is useful for anyone working with text data, such as researchers, linguists, or data analysts preparing text for further processing.

No commits in the last 6 months.

Use this if you need to accurately split large volumes of text into individual sentences for analysis, translation, or other natural language processing tasks.

Not ideal if you need to process text in languages other than Croatian or English, or if you require highly specialized segmentation rules beyond standard grammatical structures.

text-analysis linguistics data-preparation natural-language-processing content-management
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 0 / 25

How are scores calculated?

Stars

14

Forks

Language

Rust

License

MIT

Last pushed

Jul 17, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/scurkovic/cutters"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.