Ruiyang-061X/Uncertainty-o

✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Models".

/ 100

Experimental

This project helps researchers and developers working with Large Multimodal Models (LMMs) understand how confident these models are in their responses. It takes a multimodal prompt (like an image and text query) and an LMM, then outputs a quantifiable measure of the model's uncertainty about its answer. This is useful for anyone evaluating LMM performance, especially in scenarios where accuracy and reliability are critical.

No commits in the last 6 months.

Use this if you need to detect and understand 'hallucinations' or unreliable outputs from Large Multimodal Models when processing mixed image and text inputs.

Not ideal if you are working with purely text-based models or if you need a solution that directly improves LMM accuracy rather than just measuring its uncertainty.

AI model evaluation multimodal AI hallucination detection model reliability AI safety

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 8 / 25

Community 12 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

—

Higher-rated alternatives

open-thought/reasoning-gym

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Hmbown/Hegelion

Dialectical reasoning architecture for LLMs (Thesis → Antithesis → Synthesis)

LLM360/Reasoning360

A repo for open research on building large reasoning models

bowang-lab/BioReason

BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model | NeurIPS '25

TsinghuaC3I/Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

Explore LLM Tools

All categories Trending LLM Tool directory Insights