Search papers, labs, and topics across Lattice.
B-it, Probe-based steering requires roughly
2
0
4
11
Early hidden states of LLMs can predict steering success with surprising accuracy, enabling efficient steering without exhaustive rollouts.
LLMs can exhibit surprising "strategic realism" when analyzing an ongoing geopolitical conflict, but their reasoning falters in politically ambiguous situations, revealing critical domain-specific limitations.