Search papers, labs, and topics across Lattice.
2
0
3
8
LLM agent progress increasingly hinges on better external cognitive infrastructure, not just stronger models.
GUI agents learn faster and generalize better with a new reward shaping technique that dynamically adapts to successful exploration trajectories, outperforming fixed reward schemes.