Search papers, labs, and topics across Lattice.
CNRS & Universit茅 C么te d鈥橝zur, LJAD, Nice, France
1
0
2
LLMs harbor easily discoverable "natural backdoors"鈥攖oken sequences that trigger harmful outputs without any semantic instruction, revealing a concerning vulnerability beyond traditional prompt-based jailbreaks.