Search papers, labs, and topics across Lattice.
1
0
3
Untrusted AI monitors are more vulnerable to subtle "passive self-recognition" collusion strategies than previously thought, demanding a re-evaluation of safety protocols.