neelguha/legal-segmenter
A simple library for segmenting legal texts
This tool helps legal professionals break down dense legal documents into individual sentences. You feed it a block of legal text, and it returns each sentence clearly separated. It's designed for lawyers, paralegals, legal researchers, or anyone who regularly works with legal documents and needs to process them sentence by sentence.
No commits in the last 6 months. Available on PyPI.
Use this if you need to precisely divide complex legal documents, like court opinions, statutes, or contracts, into distinct sentences for easier analysis or processing.
Not ideal if you need a sophisticated natural language processing model for deeper semantic understanding of legal texts beyond simple sentence segmentation.
Stars
18
Forks
4
Language
Python
License
—
Category
Last pushed
Apr 22, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/neelguha/legal-segmenter"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
EmilStenstrom/conllu
A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.
OpenPecha/Botok
🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python
zaemyung/sentsplit
A flexible sentence segmentation library using CRF model and regex rules
taishi-i/nagisa
A Japanese tokenizer based on recurrent neural networks
natasha/razdel
Rule-based token, sentence segmentation for Russian language