OSU-NLP-Group/AttrScore

Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"

/ 100

Experimental

This project helps evaluate how well a large language model's (LLM) answer is supported by a given source text. You provide an LLM's claim (query + answer) and a reference document, and it tells you if the claim is Attributable, Extrapolatory, or Contradictory. Anyone working with LLMs who needs to verify the factual accuracy of their outputs against source material would find this useful.

No commits in the last 6 months.

Use this if you need to automatically and systematically assess the factual basis and trustworthiness of information generated by large language models.

Not ideal if you are looking for a tool to generate text or improve the fluency of an LLM's output, as this focuses on evaluation, not generation.

LLM-evaluation fact-checking content-verification AI-trustworthiness information-retrieval

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 8 / 25

Maturity 16 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

filipnaudot/llmSHAP

llmSHAP: a multi-threaded explainability framework using Shapley values for LLM-based outputs.

microsoft/automated-brain-explanations

Generating and validating natural-language explanations for the brain.

CAS-SIAT-XinHai/CPsyCoun

[ACL 2024] CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework...

wesg52/universal-neurons

Universal Neurons in GPT2 Language Models

ICTMCG/LLM-for-misinformation-research

Paper list of misinformation research using (multi-modal) large language models, i.e., (M)LLMs.

Explore LLM Tools

All categories Trending LLM Tool directory Insights