Search papers, labs, and topics across Lattice.
1
0
2
LLMs that ace math and physics still struggle with general reasoning, achieving only 63% accuracy on a new K-12 level benchmark.