Search papers, labs, and topics across Lattice.
2
0
5
Superficial rephrasing can inflate AI peer review scores by over 1.3 points, revealing a dangerous vulnerability in AI-assisted scientific evaluation.
Despite matching or exceeding human expert performance on generating potential diagnoses, current MLLMs struggle to synthesize multimodal clinical evidence for final diagnosis, revealing a critical gap in their clinical reasoning abilities.