Search papers, labs, and topics across Lattice.
2
0
4
2
Naturalness-based data selection, a common technique for curating LLM reasoning datasets, systematically favors longer, lower-quality reasoning chains due to a previously unnoticed "step length confounding" effect.
ARISE lets language models solve math problems better by learning and reusing successful solution strategies, outperforming existing RL methods, especially on harder, out-of-distribution problems.