Search papers, labs, and topics across Lattice.
University of Pavia
2
0
4
Forget retraining: NeWTral instantly restores safety to your LLM after adding a risky LoRA, slashing attack success rates from 70% to 13% without sacrificing expertise.
Certifiable defenses against malware evasion are now possible without modifying the underlying ML architecture, offering a practical path to robust security.