Search papers, labs, and topics across Lattice.
Southern University of Science and Technology
2
0
5
0
Key contribution not extracted.
MLLMs stumble badly when asked to reason about safety in lab settings, dropping 32% in performance compared to general knowledge, revealing a critical gap for real-world deployment.