Search papers, labs, and topics across Lattice.
The paper introduces CoFiCot, a coarse-to-fine framework that adaptively allocates LLM inference resources based on query difficulty, determined by a multi-metric classifier using semantic entropy, consensus reliability, and predicted reasoning depth. CoFiCot routes simple queries to efficient aggregation and complex queries to a context-aware correction loop, formalized as a stateful sequential propagation process. By integrating Process Reward Models (PRMs) within this state-dependent trajectory, CoFiCot improves error localization and global logical coherence, outperforming stateless refinement methods.
LLMs can reason more efficiently by triaging queries and applying deep thought only when truly needed, thanks to a new coarse-to-fine inference framework.
Scaling test-time computation enhances LLM reasoning ability but faces a uniform computation paradox. Allocating identical resources leads to over-correction on simple tasks and insufficient refinement on complex ones. To address this, we propose CoFiCot, a coarse-to-fine adaptive framework that dynamically tailors inference strategies to problem difficulty. Specifically, we implement a multi-metric classifier that triages queries by synthesizing semantic entropy, consensus reliability, and predicted reasoning depth . This enables a differentiated refinement stage that applies efficient aggregation for simple queries while routing complex ones to a context-aware correction loop . We formalize correction as a stateful sequential propagation process , where each repair is strictly conditioned on the verified history of prior rectifications. By integrating Process Reward Models (PRMs) within this state-dependent trajectory, CoFiCot effectively bridges the gap between granular error localization and global logical coherence, preventing the context fragmentation typical of stateless refinement methods.