THU-KEG/WaterBench

[ACL2024-Main] Data and Code for WaterBench: Towards Holistic Evaluation of LLM Watermarks

/ 100

Experimental

This project helps developers and researchers evaluate the effectiveness of different watermarking techniques for Large Language Models (LLMs). It takes various LLM outputs, applies different watermark algorithms, and provides metrics such as detection z-scores and GPT-4 based evaluation results. The primary users are researchers or engineers working on LLM security, content provenance, or responsible AI.

No commits in the last 6 months.

Use this if you are a machine learning researcher or engineer who needs to systematically test and compare how well different watermarks perform on LLMs across various datasets and models.

Not ideal if you are looking for a plug-and-play solution to apply watermarks to your LLMs without needing to dive into evaluation metrics or experiment with different watermark parameters.

LLM evaluation AI security Generative AI Content provenance Natural Language Processing research

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 6 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

THU-BPM/MarkLLM

MarkLLM: An Open-Source Toolkit for LLM Watermarking.（EMNLP 2024 System Demonstration)

git-disl/Vaccine

This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large...

zjunlp/Deco

[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

HillZhang1999/ICD

Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced...

voidism/DoLa

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality...

Explore Transformer Models

All categories Trending Transformer directory Insights