Search papers, labs, and topics across Lattice.
1
0
3
Frontier models are surprisingly good at taking actions at extremely low, calibrated probabilities, raising concerns about their ability to evade pre-deployment safety evaluations designed to catch malicious behavior.