Search papers, labs, and topics across Lattice.
State Key Lab of Processors, Institute of Computing Technology, Chinese Academy of Sciences
1
1
2
9
LLMs align even better with human preferences when trained on *less* data, revealing that preference signals are surprisingly concentrated in the initial tokens of responses.