Search papers, labs, and topics across Lattice.
2
0
5
0
By explicitly modeling the latent human evaluation process, VRM offers a more robust reward model, sidestepping the pitfalls of spurious correlations that plague traditional methods.
LLMs implicitly know if their reasoning steps are correct *during* generation, according to a new step-level interpretability method.