Search papers, labs, and topics across Lattice.
1
0
3
Freezing the right neurons during alignment makes LLMs far more resistant to safety bypasses, even when those models are open-sourced.