Search papers, labs, and topics across Lattice.
University of Maryland, College Park
2
0
4
4
Early hidden states of LLMs can predict steering success with surprising accuracy, enabling efficient steering without exhaustive rollouts.
LLMs can be tricked into using specific tools over others simply by tweaking the tool's description, even if the tool is less suitable.