Search papers, labs, and topics across Lattice.
2
0
3
0
LLM agents can be tricked into ignoring user instructions and misusing tools in over 90% of trials via a new "Memory Control Flow Attack" that exploits persistent memory influence.
LVLMs are more vulnerable than you think: a carefully crafted sequence of alternating text and visual prompts can bypass their safety mechanisms with significantly higher success rates.