Search papers, labs, and topics across Lattice.
3
0
4
Current vision-language models are surprisingly bad at interpreting scientific figures, failing to match expert-level reasoning on a new benchmark of experimental images.
GPT-5.1 can barely crack 50% accuracy when distinguishing real from AI-generated academic images, highlighting a stark gap between generative capabilities and forensic detection.
Turns out, diffusion models aren't just for generating images; they're surprisingly good at understanding them too, achieving SOTA segmentation with no architectural changes.