cvs-health/uqlm

UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection

/ 100

Verified

This tool helps people who use large language models (LLMs) to detect when the LLM might be generating incorrect or fabricated information, known as "hallucinations." You provide text prompts to an LLM, and this tool analyzes the responses to give you a confidence score, indicating how likely the answer is to be accurate. This is useful for anyone relying on LLM outputs for critical tasks, such as content creators, researchers, or customer service managers.

1,121 stars. Actively maintained with 33 commits in the last 30 days. Available on PyPI.

Use this if you need to quickly assess the trustworthiness of responses generated by large language models and want to reduce the risk of acting on false information.

Not ideal if you primarily need to improve the underlying accuracy of your LLM rather than just detecting potential errors in its outputs.

LLM-reliability content-verification AI-assurance information-quality response-evaluation

Maintenance 20 / 25

Adoption 10 / 25

Maturity 24 / 25

Community 19 / 25

How are scores calculated?

Stars

1,121

Forks

116

Language

Python

License

Apache-2.0

Compare

uqlm and query_level_uncertainty uqlm and kernel-language-entropy

Related models

PRIME-RL/TTRL

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

sapientinc/HRM

Hierarchical Reasoning Model Official Release

tigerchen52/query_level_uncertainty

query-level uncertainty in LLMs

reasoning-survey/Awesome-Reasoning-Foundation-Models

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

HKUDS/LightReasoner

"LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"

Explore Transformer Models

All categories Trending Transformer directory Insights