Search papers, labs, and topics across Lattice.
MPI for Intelligent Systems, ELLIS Institute T眉bingen, T眉bingen AI Center
1
0
2
Control interventions are often detected by LLMs, with awareness levels varying significantly across models and tasks, revealing vulnerabilities in AI safety protocols.