Search papers, labs, and topics across Lattice.
The paper investigates the relationship between depth growing and looping in LLMs, two techniques associated with improved reasoning. It demonstrates that both methods lead to convergent depth-wise signatures, suggesting a shared mechanism of iterative computation. The authors further show that depth growing and looping are composable, with inference-time looping enhancing the performance of depth-grown models on reasoning tasks, and that both techniques benefit from increased data.
Looping and depth-growing, two distinct methods for improving LLM reasoning, are actually two sides of the same iterative computation coin, and can be combined for even better results.
Looping, reusing a block of layers across depth, and depth growing, training shallow-to-deep models by duplicating middle layers, have both been linked to stronger reasoning, but their relationship remains unclear. We provide a mechanistic unification: looped and depth-grown models exhibit convergent depth-wise signatures, including increased reliance on late layers and recurring patterns aligned with the looped or grown block. These shared signatures support the view that their gains stem from a common form of iterative computation. Building on this connection, we show that the two techniques are adaptable and composable: applying inference-time looping to the middle blocks of a depth-grown model improves accuracy on some reasoning primitives by up to $2\times$, despite the model never being trained to loop. Both approaches also adapt better than the baseline when given more in-context examples or additional supervised fine-tuning data. Additionally, depth-grown models achieve the largest reasoning gains when using higher-quality, math-heavy cooldown mixtures, which can be further boosted by adapting a middle block to loop. Overall, our results position depth growth and looping as complementary, practical methods for inducing and scaling iterative computation to improve reasoning.