Search papers, labs, and topics across Lattice.
Zhejiang Normal University
1
0
3
Freezing a Sparse Autoencoder's encoder creates a reusable "safety dictionary" that generalizes to new risks in text-to-image diffusion models, offering a more robust alternative to fixed-layer steering.