GanjinZero/CODER

CODER: Knowledge infused cross-lingual medical term embedding for term normalization. [JBI, ACL-BioNLP 2022]

/ 100

Experimental

This tool helps medical researchers and clinical informatics specialists standardize medical terms found in diverse text sources, including those in multiple languages. It takes unstructured medical text as input and identifies the correct, consistent medical concepts, even if they are expressed differently. The output is a normalized, concept-aligned representation of these terms, making them consistent for analysis or integration.

No commits in the last 6 months.

Use this if you need to precisely map various ways medical terms are written (e.g., abbreviations, synonyms, or different languages) to a single, standardized concept for better data analysis or system interoperability.

Not ideal if your primary need is general natural language processing outside of specialized medical terminology or if you don't require cross-lingual term normalization.

medical-nlp clinical-informatics biomedical-data-standardization health-data-management medical-terminology-mapping

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 9 / 25

Maturity 8 / 25

Community 10 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

mims-harvard/ClinVec

ClinVec: Unified Embeddings of Clinical Codes Enable Knowledge-Grounded AI in Medicine

NYUMedML/DeepEHR

Chronic Disease Prediction Using Medical Notes

mims-harvard/SHEPHERD

SHEPHERD: Few shot learning for phenotype-driven diagnosis of patients with rare genetic diseases

biocentral/biocentral_server

Compute functionality for biocentral.

nomic-ai/contrastors

Train Models Contrastively in Pytorch

Explore Embedding Tools

All categories Trending Embeddings directory Insights