NTUJun 15, 2026arXiv:2606.16504

APEX: Adaptive Policy Execution for Precise Manipulation

Mengfei Zhao, Chenxi Jiang, Tuo An, Jindou Jia, Jianfei Yang

AI Summary

This paper introduces APEX, a novel framework designed to bridge the execution gap in imitation learning by adapting high-level action outputs to low-level control dynamics without modifying the existing policy or controller. By reconstructing dynamically feasible references and incorporating real-time low-level state feedback, APEX achieves a significant reduction in tracking error and enhances manipulation success rates across various policy classes. The empirical results demonstrate a 41.2% decrease in controller-induced tracking error and an improvement in manipulation success by up to 25.8 percentage points, underscoring the framework's effectiveness in precise manipulation tasks.

Key Contribution

APEX reduces tracking error by over 41% and boosts manipulation success rates significantly, all without altering existing policies or controllers.

Abstract

Modern imitation learning methods, including visuomotor and Vision-Language-Action (VLA) policies, typically output high-level action references that are executed by low-level controllers. However, the absence of higher-order reference signals, together with the policy's lack of awareness of the underlying low-level control dynamics during training, inevitably induces an execution gap. As a result, realized actions deviate systematically from policy-commanded ones, with a critical impact on precision-sensitive manipulation. Prior work either modifies the policy architecture or the low-level controller, both requiring intrusive changes to the pretrained policy or packaged controller. This raises a natural question: when the policy and controller are both treated as inaccessible black boxes, can we bridge the execution gap? We propose Adaptive Policy Execution (APEX), a plug-and-play framework inserted between the policy and the controller that reconstructs a dynamically feasible reference from policy outputs and adapts at test-time according to low-level state feedback, with a provable convergence guarantee. Extensive empirical studies show that APEX reduces controller-induced tracking error by 41.2% on demonstration replay and improves manipulation success by 4.8--25.8 percentage points across four visuomotor and VLA policy classes.

Multimodal Models Robotics & Embodied AI

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

APEX: Adaptive Policy Execution for Precise Manipulation

Related Papers