Search papers, labs, and topics across Lattice.
1
0
DynaCF reveals that dynamically adjusting sample weights based on shortcut sensitivity can drastically improve the robustness of reward models against superficial cues.