Search papers, labs, and topics across Lattice.
2
0
4
1
LLM benchmarks are increasingly measuring the capabilities of yesterday's models, not today's frontier, creating a widening gap that misrepresents the state of AI.
LLMs withhold life-saving medical advice from laypersons, even when they know the answer, revealing a dangerous side effect of current AI safety measures.