IS2Lab/S-Eval

S-Eval: Towards Automated and Comprehensive Safety Evaluation for Large Language Models

/ 100

Emerging

This project provides a comprehensive set of evaluation prompts to test the safety of Large Language Models (LLMs) against various harmful outputs. It takes LLM responses to these prompts as input and helps identify if the model generates content related to crimes, hate speech, privacy violations, or other unsafe categories. This is primarily for AI safety researchers and developers who are building or deploying LLMs and need to ensure their models are not generating problematic content.

111 stars.

Use this if you need a structured, multi-dimensional benchmark to systematically assess the safety performance of your Large Language Models.

Not ideal if you are a casual user looking for a simple, single-metric safety check for a pre-existing LLM.

AI-safety LLM-evaluation harmful-content-detection model-testing responsible-AI

No Package No Dependents

Maintenance 10 / 25

Adoption 9 / 25

Maturity 16 / 25

Community 9 / 25

How are scores calculated?

Stars

111

Forks

Language

—

License

—

Featured in

You're Shipping AI You Can't Measure

Higher-rated alternatives

EvolvingLMMs-Lab/lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

vibrantlabsai/ragas

Supercharge Your LLM Application Evaluations 🚀

open-compass/VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

EuroEval/EuroEval

The robust European language model benchmark.

Giskard-AI/giskard-oss

🐢 Open-Source Evaluation & Testing library for LLM Agents

Explore LLM Tools

All categories Trending LLM Tool directory Insights