Search papers, labs, and topics across Lattice.
1
0
3
Forget expensive, error-prone math problems: PDDL planning offers a surprisingly effective and scalable route to training better Process Reward Models for LLM reasoning.