medspacy/sectionizer

A rule-based Python module for spitting documents into sections.

/ 100

Emerging

This tool helps healthcare professionals and researchers automatically identify and label different sections within clinical documents like patient notes or discharge summaries. It takes unstructured medical text as input and outputs the same text with clearly marked sections, such as 'Chief Complaint', 'History of Present Illness', or 'Medications'. This is useful for anyone working with large volumes of clinical text who needs to quickly extract or organize information by section.

No commits in the last 6 months.

Use this if you need to programmatically identify and label standard sections within unstructured clinical text documents.

Not ideal if you are working with non-clinical documents or if you need to extract specific entities rather than document sections.

clinical-documentation healthcare-research medical-nlp health-informatics text-organization

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 15 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

MIT

Higher-rated alternatives

EmilStenstrom/conllu

A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.

OpenPecha/Botok

🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python

taishi-i/nagisa

A Japanese tokenizer based on recurrent neural networks

zaemyung/sentsplit

A flexible sentence segmentation library using CRF model and regex rules

natasha/razdel

Rule-based token, sentence segmentation for Russian language

Explore NLP Tools

All categories Trending NLP directory Insights