Search papers, labs, and topics across Lattice.
This paper introduces APEX, a novel framework designed to bridge the execution gap in imitation learning by adapting high-level action outputs to low-level control dynamics without modifying the existing policy or controller. By reconstructing dynamically feasible references and incorporating real-time low-level state feedback, APEX achieves a significant reduction in tracking error and enhances manipulation success rates across various policy classes. The empirical results demonstrate a 41.2% decrease in controller-induced tracking error and an improvement in manipulation success by up to 25.8 percentage points, underscoring the framework's effectiveness in precise manipulation tasks.
APEX reduces tracking error by over 41% and boosts manipulation success rates significantly, all without altering existing policies or controllers.
Modern imitation learning methods, including visuomotor and Vision-Language-Action (VLA) policies, typically output high-level action references that are executed by low-level controllers. However, the absence of higher-order reference signals, together with the policy's lack of awareness of the underlying low-level control dynamics during training, inevitably induces an execution gap. As a result, realized actions deviate systematically from policy-commanded ones, with a critical impact on precision-sensitive manipulation. Prior work either modifies the policy architecture or the low-level controller, both requiring intrusive changes to the pretrained policy or packaged controller. This raises a natural question: when the policy and controller are both treated as inaccessible black boxes, can we bridge the execution gap? We propose Adaptive Policy Execution (APEX), a plug-and-play framework inserted between the policy and the controller that reconstructs a dynamically feasible reference from policy outputs and adapts at test-time according to low-level state feedback, with a provable convergence guarantee. Extensive empirical studies show that APEX reduces controller-induced tracking error by 41.2% on demonstration replay and improves manipulation success by 4.8--25.8 percentage points across four visuomotor and VLA policy classes.