Search papers, labs, and topics across Lattice.
University of Chinese Academy of Sciences ♢ Meituan
2
0
5
LLMs that ace math and physics still struggle with general reasoning, achieving only 63% accuracy on a new K-12 level benchmark.
AI-generated code is shipping with critical vulnerabilities that existing security tools miss, but VibeGuard catches them with high precision and recall.