Search papers, labs, and topics across Lattice.
Northwestern University
1
0
Human response time, often discarded, unlocks in-context adaptation to unseen preference domains for RLHF, outperforming standard transformers.