CUHKShanghai AI LabShanghaiTechSJTUMay 31, 2026arXiv:2606.01098

Implicit Drifting Policy: One-Step Action Generation via Conditional Expert Geometry

Zemin Yang, Yaoyu He, Yiming Zhong, Yuhao Zhang, Xinge Zhu, Yao Mu, Qingqiu Huang, Yuexin Ma

AI Summary

This paper introduces the Implicit Drifting Policy (IDP), a novel one-step imitation learning framework designed to enhance action generation for high-frequency robot control by leveraging conditional expert geometry. By extracting local variations from expert actions and comparing them to a global reference, IDP effectively enforces manifold constraints without the need for explicit vector field estimation, addressing the limitations of traditional iterative sampling methods. Extensive evaluations demonstrate that IDP not only maintains adherence to valid action manifolds but also outperforms explicit drifting methods while achieving competitive results against strong one-step baselines across various tasks.

Key Contribution

IDP achieves high-frequency robot control by enforcing action manifold constraints without the computational burden of iterative sampling.

Abstract

Generative action policies based on diffusion or flow matching excel in behavior cloning, yet their iterative sampling is prohibitive for high-frequency robot control. While recent one-step formulations alleviate this latency, they inevitably discard the intermediate trajectory evolution that provides crucial action correction. Directly recovering this mechanism by explicitly estimating a training-time drifting field is mathematically ill-posed due to extreme conditional demonstration sparsity. We introduce Implicit Drifting Policy (IDP), a one-step imitation learning framework that brings the training-time correction of Drifting into policy learning without explicit vector field estimation. IDP extracts a conditional expert geometry from the local variation of observation-similar expert actions, and compares it against a global reference geometry to isolate condition-specific constraints. This local geometric structure adaptively weights a scalar potential objective. Combined with an expert-proximal terminal evaluation, IDP directly enforces manifold constraints on the one-step generator during training. Extensive evaluations across 2D, 3D, and real-world manipulation tasks show IDP effectively maintains adherence to valid action manifolds, improving upon explicit drifting methods and achieving competitive performance with strong one-step baselines.

Robotics & Embodied AI

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Implicit Drifting Policy: One-Step Action Generation via Conditional Expert Geometry

Related Papers