zjunlp/LookAheadTuning

[WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews

/ 100

Experimental

This project helps large language model (LLM) developers make their models safer and more reliable. By modifying training data to include 'partial answer previews', you can train models that are less prone to generating unsafe or incorrect outputs. You provide your existing LLM training datasets, and it outputs modified datasets ready for a more robust fine-tuning process, specifically designed for those building or enhancing LLMs.

Use this if you are a machine learning engineer or researcher responsible for fine-tuning large language models and need to improve their safety and reduce undesirable outputs.

Not ideal if you are an end-user simply looking to apply an existing language model without deep involvement in its training or fine-tuning process.

large-language-models ai-safety llm-fine-tuning data-preprocessing machine-learning-engineering

No Package No Dependents

Maintenance 6 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Python

License

MIT

Higher-rated alternatives

aalok-sathe/surprisal

A unified interface for computing surprisal (log probabilities) from language models! Supports...

EvolvingLMMs-Lab/lmms-engine

A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

FunnySaltyFish/Better-Ruozhiba

【逐条处理完成】人为审核+修改每一条的弱智吧精选问题QA数据集

reasoning-machines/pal

PaL: Program-Aided Language Models (ICML 2023)

microsoft/monitors4codegen

Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static...

Explore LLM Tools

All categories Trending LLM Tool directory Insights