Search papers, labs, and topics across Lattice.
1
0
2
Activation probes can predict future behaviors in reasoning models with up to 91% accuracy, enabling effective steering without sacrificing output quality.