chaoluond/safetyllama

Finetune LLaMA-2-7b-chat to perform safety evaluation of user-bot conversation

/ 100

Experimental

This project helps ensure AI chatbots provide safe and appropriate responses to user prompts. It takes a conversation between a human and a chatbot as input and outputs an evaluation stating whether the chatbot's response adheres to a set of safety guidelines. AI product managers, trust & safety teams, or developers building AI applications would use this tool.

No commits in the last 6 months.

Use this if you need to automatically monitor and detect unsafe or inappropriate content generated by your AI chatbot before it reaches users.

Not ideal if you are looking for a general-purpose content moderation tool that flags user-generated content directly, rather than chatbot outputs.

AI Safety Content Moderation Chatbot Development Trust & Safety AI Ethics

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 5 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Higher-rated alternatives

Pyenb/Ollama-models

A collection of zipped Ollama models for offline use. Simply download, extract, and set up your...

jaypatel15406/Ollama-Adaptive-Image-Code-Gen

Ollama Adaptive Image Code Gen is an asynchronous Python application that uses LLMs to...

Marvin-VW/python-ollama-local

This Python script enables hands-free interaction with a local Llama2 language model. It...

cognisoc/mullama

Drop-in Ollama replacement. All-in-one local LLM toolkit.

rajkundalia/error-analyzer-with-baml

Analyze Java compilation and runtime errors using BAML with a local Ollama model. This project...

Explore Transformer Models

All categories Trending Transformer directory Insights