Search papers, labs, and topics across Lattice.
1
0
3
Human preference judgments in PbRL are surprisingly modality-dependent: switch from text to audio and you'll see narrower decision thresholds, reduced length bias, and a shift towards user-oriented evaluation.