Search papers, labs, and topics across Lattice.
1
0
2
Human response time, often discarded, unlocks in-context adaptation to unseen preference domains for RLHF, outperforming standard transformers.