Search papers, labs, and topics across Lattice.
1
0
2
LLM benchmarks are increasingly measuring the capabilities of yesterday's models, not today's frontier, creating a widening gap that misrepresents the state of AI.