leduckhai/MultiMed

[LREC-COLING 2024 (Oral), Interspeech 2024 (Oral), NAACL 2025, ACL 2025, EMNLP 2025] A Series of Multilingual Multitask Medical Speech Processing

/ 100

Emerging

This project offers tools to understand medical conversations by converting spoken language into text, extracting key information, and summarizing discussions. It takes audio recordings of medical speech, such as doctor-patient dialogues, and outputs detailed transcripts, identified medical terms, and concise summaries. Medical professionals, researchers studying medical communication, and healthcare administrators can use this to streamline documentation and analyze interactions.

373 stars.

Use this if you need to accurately transcribe medical conversations, identify specific medical terms within speech, or generate summaries from spoken medical interactions across multiple languages.

Not ideal if your primary need is for general domain speech recognition or summarization outside of a medical context.

medical-transcription healthcare-documentation clinical-communication medical-natural-language-processing speech-to-text-healthcare

No License No Package No Dependents

Maintenance 6 / 25

Adoption 10 / 25

Maturity 8 / 25

Community 16 / 25

How are scores calculated?

Stars

373

Forks

Language

Python

License

—

Higher-rated alternatives

THU-KEG/OmniEvent

A comprehensive, unified and modular event extraction toolkit.

lil-lab/nlvr

Cornell NLVR and NLVR2 are natural language grounding datasets. Each example shows a visual...

Explore NLP Tools

All categories Trending NLP directory Insights