huggingface/alignment-handbook

Robust recipes to align language models with human and AI preferences

/ 100

Established

This project helps machine learning engineers and researchers improve language models by providing clear, robust training methods. It offers practical guidance and code to fine-tune existing models, ensuring they better follow instructions and align with human or AI preferences. You would use this if you're working on making chatbots safer, more helpful, or better adapted to specific tasks.

5,523 stars. No commits in the last 6 months. Available on PyPI.

Use this if you need concrete, tested strategies and code to refine open-source language models to meet specific performance or behavioral standards.

Not ideal if you are an end-user looking for a pre-built chatbot or a non-technical person without experience in machine learning model development.

Large Language Models AI Alignment Model Fine-tuning Chatbot Development Machine Learning Research

Stale 6m

Maintenance 2 / 25

Adoption 10 / 25

Maturity 25 / 25

Community 19 / 25

How are scores calculated?

Stars

5,523

Forks

474

Language

Python

License

Apache-2.0

Related models

agentscope-ai/Trinity-RFT

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement...

OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO &...

zjunlp/EasyEdit

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

hyunwoongko/nanoRLHF

nanoRLHF: from-scratch journey into how LLMs and RLHF really work.

PKU-Alignment/align-anything

Align Anything: Training All-modality Model with Feedback

Explore Transformer Models

All categories Trending Transformer directory Insights