Search papers, labs, and topics across Lattice.
1
0
4
2
Agentic coding gets a serious boost: representing rollouts as structured summaries and then recursively comparing them lets Claude-4.5-Opus jump from 70.9% to 77.6% on SWE-Bench Verified.