GanjinZero/CODER
CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]
This tool helps medical researchers and clinical informatics specialists standardize medical terms found in diverse text sources, including those in multiple languages. It takes unstructured medical text as input and identifies the correct, consistent medical concepts, even if they are expressed differently. The output is a normalized, concept-aligned representation of these terms, making them consistent for analysis or integration.
No commits in the last 6 months.
Use this if you need to precisely map various ways medical terms are written (e.g., abbreviations, synonyms, or different languages) to a single, standardized concept for better data analysis or system interoperability.
Not ideal if your primary need is general natural language processing outside of specialized medical terminology or if you don't require cross-lingual term normalization.
Stars
80
Forks
6
Language
Python
License
—
Category
Last pushed
Jun 28, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/GanjinZero/CODER"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
mims-harvard/ClinVec
ClinVec: Unified Embeddings of Clinical Codes Enable Knowledge-Grounded AI in Medicine
NYUMedML/DeepEHR
Chronic Disease Prediction Using Medical Notes
mims-harvard/SHEPHERD
SHEPHERD: Few shot learning for phenotype-driven diagnosis of patients with rare genetic diseases
biocentral/biocentral_server
Compute functionality for biocentral.
nomic-ai/contrastors
Train Models Contrastively in Pytorch