Search papers, labs, and topics across Lattice.
1
0
3
Forget static rubrics: SibylSense adaptively learns rubrics at inference time, leading to more discriminative rewards and better RL performance in open-ended generation tasks.