Search papers, labs, and topics across Lattice.
Reasoning-Lab/SPUR BUPT-Reasoning-Lab, Beijing University of Posts and Telecommunications
1
0
3
Current vision-language models are surprisingly bad at interpreting scientific figures, failing to match expert-level reasoning on a new benchmark of experimental images.