Search papers, labs, and topics across Lattice.
University of Science and Technology of China
2
0
4
0
LLM agents struggle to juggle multiple tasks when tool use involves realistic delays, revealing critical weaknesses in temporal reasoning and coordination.
Unlock long-context reasoning in LLMs by turning agent trajectories into gold-standard QA pairs, outperforming models 8x larger on challenging reasoning tasks.