Search papers, labs, and topics across Lattice.
Tongji University
2
0
4
Current multimodal agents fail to consistently pass CAPTCHA tests, revealing fundamental limitations in their ability to replace humans in automated workflows.
VLMs don't fail to *recognize* harmful intent when jailbroken; instead, visual inputs *shift* their internal representations into a distinct "jailbreak state," opening a new avenue for defense.