eric-ai-lab/MSSBench

[ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"

/ 100

Emerging

This project helps evaluate multimodal AI models to determine if they can identify and avoid unsafe situations based on images and instructions. It takes in multimodal AI models, a dataset of images and associated safe/unsafe descriptions or instructions, and outputs a safety score or evaluation for the models. It is designed for AI safety researchers and developers working on responsible AI.

No commits in the last 6 months.

Use this if you are developing or evaluating AI systems that interact with users through images and text and need to ensure they operate safely.

Not ideal if you are looking for a pre-built safety filter or an AI model to directly apply to a production system.

AI safety responsible AI multimodal AI AI evaluation human-AI interaction

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 6 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

MIT

Higher-rated alternatives

google-deepmind/concordia

A library for generative social simulation

Mai-xiyu/Minecraft_AI

AI Play Minecraft

mikelma/craftium

A framework for creating rich, 3D, Minecraft-like single and multi-agent environments for AI...

cocacola-lab/MineLand

Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs

rezaho/MARSYS

Multi-Agent Reasoning Systems

Explore AI Agents

All categories Trending AI Agent directory Insights