Search papers, labs, and topics across Lattice.
2
0
5
12
Despite their visual reasoning prowess, today's MLLMs still struggle to understand handwritten math scratchwork, falling far short of human expert performance in diagnosing student errors.
LLMs in medicine may be dangerously overhyped: even the best models achieve only 39% accuracy on a contamination-free, real-world clinical benchmark, with performance tanking on newer cases.