olaflaitinen/llm-proteomics-hallucination

Systematic evaluation of hallucination risks in Large Language Models (GPT-4, Claude 3, Gemini Pro) for clinical proteomics and mass spectrometry interpretation. Production-ready detection framework with comprehensive benchmarks.

/ 100

Emerging

This project helps clinical researchers and medical professionals understand the risks of using large language models (LLMs) for interpreting clinical proteomics and mass spectrometry data. It takes LLM responses to specialized queries about proteins and their modifications, and outputs a detailed evaluation of their accuracy, highlighting hallucination rates and risk factors. This is for medical researchers, lab directors, or clinicians considering using AI for diagnostic support in proteomics.

Use this if you are a clinical proteomics expert concerned about the reliability of AI-generated insights for patient care and need to quantify hallucination risks.

Not ideal if you are looking for an LLM to directly integrate into a clinical workflow without rigorous validation or human oversight, as it demonstrates significant safety concerns.

Clinical Proteomics Mass Spectrometry Diagnostic Accuracy AI in Medicine Patient Safety

No Package No Dependents

Maintenance 6 / 25

Adoption 5 / 25

Maturity 13 / 25

Community 13 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

Goekdeniz-Guelmez/mlx-lm-lora

Train Large Language Models on MLX.

uber-research/PPLM

Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.

VHellendoorn/Code-LMs

Guide to using pre-trained large language models of source code

ssbuild/chatglm_finetuning

chatglm 6b finetuning and alpaca finetuning

jarobyte91/pytorch_beam_search

A lightweight implementation of Beam Search for sequence models in PyTorch.

Explore Transformer Models

All categories Trending Transformer directory Insights