Search papers, labs, and topics across Lattice.
2
0
4
1
LLMs can be efficiently aligned with human preferences using only a small, carefully selected subset of training data, without sacrificing generalization ability.
Optimal Transport offers a surprisingly effective and theoretically grounded approach to preference learning, outperforming existing methods in aligning LLMs with human values and reasoning abilities.