Search papers, labs, and topics across Lattice.
University of Bristol
1
0
2
14
Control knobs for LLM safety exist: MASCing lets you steer MoE behavior *without* costly retraining, boosting jailbreak defense by up to 89.2% and adult content generation control by up to 93.0%.