Search papers, labs, and topics across Lattice.
University of Chinese Academy of Sciences, Beijing, China
1
0
3
Safety training doesn't just make models refuse more, it fundamentally *reorganizes* where and how those refusals happen inside the network.