yihedeng9/DuoGuard

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

/ 100

Emerging

This project helps developers and engineers ensure the safety and appropriateness of responses generated by large language models (LLMs) in multiple languages. It takes an LLM's output in various languages and determines if it contains unsafe or undesirable content. This is useful for AI developers, product managers, or content moderation teams building or deploying multilingual LLMs.

No commits in the last 6 months.

Use this if you need to automatically detect and flag unsafe content from your LLMs across multiple languages with high accuracy and efficiency.

Not ideal if you are looking for a tool to generate text or translate content, as its primary function is safety moderation.

AI-safety content-moderation LLM-deployment multilingual-AI responsible-AI

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 7 / 25

Maturity 16 / 25

Community 11 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

ethz-spylab/agentdojo

A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.

guardrails-ai/guardrails

Adding guardrails to large language models.

JasonLovesDoggo/caddy-defender

Caddy module to block or manipulate requests originating from AIs or cloud services trying to...

inkdust2021/VibeGuard

Uses just 1% memory while protecting 99% of your personal privacy.

deadbits/vigil-llm

⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language...

Explore LLM Tools

All categories Trending LLM Tool directory Insights