Search papers, labs, and topics across Lattice.
Tara Research, Technical University of Munich
1
0
5
0
Continuously nudging LLM activations during generation can effectively correct misalignment without sacrificing coherence, offering a lightweight runtime defense against adversarial prompts and other triggers.