Search papers, labs, and topics across Lattice.
University of Maryland
2
0
4
16
Edge-level methods uncover how irrelevant numerical anchors influence language model judgments, revealing shared pathways that shift with model tuning.
Steering vectors work primarily by nudging the output value (OV) circuit in attention, not by re-weighting attention scores, and can be drastically sparsified without losing effectiveness.