Search papers, labs, and topics across Lattice.
University of Pavia
1
0
3
Forget retraining: NeWTral instantly restores safety to your LLM after adding a risky LoRA, slashing attack success rates from 70% to 13% without sacrificing expertise.