AlexanderVNikitin/kernel-language-entropy

Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)

/ 100

Emerging

This tool helps AI researchers and practitioners evaluate how confident a large language model (LLM) is about its generated responses. It takes an LLM's output and determines a fine-grained uncertainty score by analyzing the semantic similarities in its predictions. Researchers building or deploying LLMs would use this to understand and improve model reliability.

No commits in the last 6 months.

Use this if you are developing or evaluating large language models and need to quantify their uncertainty in a more detailed way than traditional methods.

Not ideal if you are looking for a simple, out-of-the-box solution without access to GPU hardware or experience with Python environments.

AI-research LLM-evaluation model-reliability natural-language-processing

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 14 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

BSD-3-Clause-Clear

Compare

kernel-language-entropy and uqlm

Higher-rated alternatives

cvs-health/uqlm

UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM...

PRIME-RL/TTRL

[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

sapientinc/HRM

Hierarchical Reasoning Model Official Release

tigerchen52/query_level_uncertainty

query-level uncertainty in LLMs

reasoning-survey/Awesome-Reasoning-Foundation-Models

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

Explore Transformer Models

All categories Trending Transformer directory Insights