Search papers, labs, and topics across Lattice.
University of Pennsylvania
2
0
5
Meerkat finds nearly 4x more examples of reward hacking on CyBench than previous audits by combining clustering with agentic search to uncover violations across many agent traces.
Uncovers how GNNs internally implement and reuse algorithmic steps, paving the way for understanding and controlling neural algorithmic reasoning.