Search papers, labs, and topics across Lattice.
2
0
5
2
Naturalness-based data selection, a common technique for curating LLM reasoning datasets, systematically favors longer, lower-quality reasoning chains due to a previously unnoticed "step length confounding" effect.
Training domain-specific coding LLMs with realistic environments and large-scale RL can yield substantial gains in practical software engineering tasks.