Search papers, labs, and topics across Lattice.
2
5
6
6
Forget simple scaling laws: the compute-optimal number of parallel rollouts in LLM RL plateaus, revealing distinct mechanisms for easy vs. hard problems.
LLMs gain a whopping 124% task completion boost when coupled with a world model that enables simulative reasoning, suggesting a path beyond token-by-token autoregression.