Search papers, labs, and topics across Lattice.
University of Chinese Academy of Sciences ♢ Meituan
1
0
2
LLMs that ace math and physics still struggle with general reasoning, achieving only 63% accuracy on a new K-12 level benchmark.