Search papers, labs, and topics across Lattice.
2
48
4
7
GPT-5 can ace most agent benchmarks, but put it in a dynamic, real-world environment and it chokes on time-sensitive tasks, exposing a critical "sim2real" gap.
LLMs can now play at being AI researchers, but they're mostly just good at hyperparameter sweeps, not groundbreaking discoveries.