Search papers, labs, and topics across Lattice.
Indiana University Bloomington
1
0
2
Finetuning LLMs on narrow safety tasks can induce emergent alignment, revealing significant differences in how well ethical personas project across various alignment strategies.