ctakes and ctakesspark
The second is a complementary extension that adds distributed processing capabilities to the first, allowing cTAKES NLP pipelines to scale across Spark clusters rather than running on a single machine.
About ctakes
apache/ctakes
Apache cTAKES is a Natural Language Processing (NLP) platform for clinical text.
This platform helps healthcare professionals automatically extract critical medical information from unstructured clinical text, such as patient notes, discharge summaries, or radiology reports. It takes these natural language documents as input and identifies concepts like symptoms, diagnoses, medications, and procedures, along with their attributes and standard medical codes. Medical researchers, clinicians, and health data analysts who need to process large volumes of clinical narratives would use this.
About ctakesspark
yugagarin/ctakesspark
Attempt to integrate Apache cTakes with Apache Spark
Scores updated daily from GitHub, PyPI, and npm data. How scores work