AI45Lab/DeepScan

Diagnostic Framework for LLMs and MLLMs

/ 100

Experimental

This framework helps AI researchers and engineers evaluate Large Language Models (LLMs) and Multimodal LLMs (MLLMs) to understand their behavior and identify potential issues. You input a model (like Qwen, Llama, or Gemma) and a dataset, and it outputs detailed diagnostic reports and analyses of the model's performance and internal workings. It's designed for anyone who needs to rigorously test and improve the reliability and safety of large AI models.

Use this if you need a structured way to diagnose the performance and safety aspects of various LLMs and MLLMs, getting detailed insights beyond simple accuracy metrics.

Not ideal if you're looking for a simple, out-of-the-box solution for basic model evaluation without needing deep diagnostic insights or customizability.

LLM-evaluation AI-safety model-diagnosis AI-benchmarking AI-research

No Package No Dependents

Maintenance 10 / 25

Adoption 7 / 25

Maturity 11 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

—

Higher-rated alternatives

harshhh28/hia

Hia (Health Insights Agent) - AI Agent to analyze blood reports and provide detailed health insights.

aminezouari52/telemedicine-website

A Real-time text-based consultations between doctors and patients.

Sachinn-64/CureConnect

Experience seamless healthcare with our telemedicine platform—video consultations, real-time...

Doctor-One/doctor-dok

Doctor Dok is an AI based medical data framework and patient's med vault. Parse any health...

cbib/TrialMatchAI

TrialMatchAI aims to leverage large language models (LLMs) to streamline patient matching with...

Explore LLM Tools

All categories Trending LLM Tool directory Insights