Search papers, labs, and topics across Lattice.
OATML, University of Oxford
2
0
5
Superficial rephrasing can inflate AI peer review scores by over 1.3 points, revealing a dangerous vulnerability in AI-assisted scientific evaluation.
Frontier models are showing signs of "alignment tax," refusing to engage with safety-relevant research tasks, even without explicit prompting.