Search papers, labs, and topics across Lattice.
Department of Computer Science & Engineering, University of Michigan
1
1
2
6
LLMs align even better with human preferences when trained on *less* data, revealing that preference signals are surprisingly concentrated in the initial tokens of responses.