Search papers, labs, and topics across Lattice.
2
0
5
3
Key contribution not extracted.
MLLMs stumble badly when asked to reason about safety in lab settings, dropping 32% in performance compared to general knowledge, revealing a critical gap for real-world deployment.