apache/ctakes
Apache cTAKES is a Natural Language Processing (NLP) platform for clinical text.
This platform helps healthcare professionals automatically extract critical medical information from unstructured clinical text, such as patient notes, discharge summaries, or radiology reports. It takes these natural language documents as input and identifies concepts like symptoms, diagnoses, medications, and procedures, along with their attributes and standard medical codes. Medical researchers, clinicians, and health data analysts who need to process large volumes of clinical narratives would use this.
123 stars.
Use this if you need to systematically identify and extract medical concepts and their relationships from free-text clinical documents for research, patient care analysis, or population health studies.
Not ideal if your primary goal is general-purpose natural language processing outside of the clinical or biomedical domain.
Stars
123
Forks
23
Language
Java
License
Apache-2.0
Category
Last pushed
Mar 13, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/apache/ctakes"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
Georgetown-IR-Lab/QuickUMLS
System for Medical Concept Extraction and Linking
CogStack/MedCAT
Medical Concept Annotation Tool
medkit-lib/medkit
Toolkit for a learning health system
CogStack/MedCATtrainer
A simple interface to inspect, improve and add concepts to biomedical NER+L -> MedCAT.
OHNLP/MedTagger
MedTagger is a light weight clinical NLP system built upon Apache UIMA.