Lattice
Lattice

Search

Search papers, labs, and topics across Lattice.

RLHF in an SFT Way: From Optimal Solution to Reward-Weighted Alignment | Lattice