Search papers, labs, and topics across Lattice.
1
0
4
Distilling refusal behavior into smaller LLMs can backfire, *increasing* their vulnerability to jailbreaks in multilingual settings.