Search papers, labs, and topics across Lattice.
4
0
5
Even with robust training techniques like EOT, a carefully crafted adversarial patch can reliably fool VIS-IR VLMs and transfer across tasks like classification, captioning, and VQA.
VLMs can be easily fooled in the real world by strategically manipulating lighting, causing them to misinterpret scenes and hallucinate nonsensical captions.
Robot control systems are shockingly vulnerable: JailWAM achieves an 84.2% success rate in jailbreaking state-of-the-art World Action Models to perform unsafe physical actions.
VLMs can be devastatingly fooled by modifying less than 2% of image pixels in a fixed, X-shaped pattern, causing them to fail spectacularly across diverse tasks like classification, captioning, and question answering.