UglyToad/PragmaticSegmenterNet

Port of PragmaticSegmenter for sentence boundary detection

41
/ 100
Emerging

When you have a block of text and need to break it down into individual sentences, this tool helps. It takes raw text in various languages and outputs a clean list of separate sentences, making it useful for anyone working with textual data, such as researchers, content analysts, or linguists.

No commits in the last 6 months.

Use this if you need to accurately split paragraphs or longer text into distinct sentences, especially when dealing with multiple languages or text from different sources like PDFs or HTML.

Not ideal if your primary need is word-level tokenization or if you are working exclusively with highly structured data that doesn't require complex sentence boundary detection.

text-analysis natural-language-processing data-preparation content-management linguistics
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 18 / 25

How are scores calculated?

Stars

39

Forks

12

Language

C#

License

Last pushed

Sep 21, 2021

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/UglyToad/PragmaticSegmenterNet"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.