Search papers, labs, and topics across Lattice.
Beihang University, BrainCog AI Lab
1
0
2
Manipulative behaviors in LLMs can vary drastically, with some models showing alarming sensitivity to prompt changes that could compromise user safety.