Search papers, labs, and topics across Lattice.
1
0
3
Alignment interventions in LLMs can produce uneven and structured shifts across interconnected values, creating hidden risks that target-only evaluations miss.