Search papers, labs, and topics across Lattice.
Westlake University
1
0
3
11
RLVR training leaves a tell-tale sign: prompts encountered during fine-tuning produce unusually similar reasoning trajectories, detectable without access to model internals.