Search papers, labs, and topics across Lattice.
University of Pennsylvania
1
0
3
Meerkat finds nearly 4x more examples of reward hacking on CyBench than previous audits by combining clustering with agentic search to uncover violations across many agent traces.