leduckhai/MultiMed

[LREC-COLING 2024 (Oral), Interspeech 2024 (Oral), NAACL 2025, ACL 2025, EMNLP 2025] A Series of Multilingual Multitask Medical Speech Processing

40
/ 100
Emerging

This project offers tools to understand medical conversations by converting spoken language into text, extracting key information, and summarizing discussions. It takes audio recordings of medical speech, such as doctor-patient dialogues, and outputs detailed transcripts, identified medical terms, and concise summaries. Medical professionals, researchers studying medical communication, and healthcare administrators can use this to streamline documentation and analyze interactions.

373 stars.

Use this if you need to accurately transcribe medical conversations, identify specific medical terms within speech, or generate summaries from spoken medical interactions across multiple languages.

Not ideal if your primary need is for general domain speech recognition or summarization outside of a medical context.

medical-transcription healthcare-documentation clinical-communication medical-natural-language-processing speech-to-text-healthcare
No License No Package No Dependents
Maintenance 6 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 16 / 25

How are scores calculated?

Stars

373

Forks

36

Language

Python

License

Last pushed

Dec 31, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/leduckhai/MultiMed"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.