Search papers, labs, and topics across Lattice.
HiPolicy addresses the trade-off between long-horizon planning and fine-grained control in robotic imitation learning by introducing a hierarchical multi-frequency action chunking framework. The method predicts action sequences at different frequencies, extracting and fusing hierarchical features from historical observations aligned to each frequency. Experiments demonstrate that HiPolicy improves performance and execution efficiency when integrated into existing 2D and 3D generative policies on simulated and real-world tasks.
Achieve both long-horizon planning and fine-grained control in robotic imitation learning by predicting action sequences at different frequencies.
Robotic imitation learning faces a fundamental trade-off between modeling long-horizon dependencies and enabling fine-grained closed-loop control. Existing fixed-frequency action chunking approaches struggle to achieve both. Building on this insight, we propose HiPolicy, a hierarchical multi-frequency action chunking framework that jointly predicts action sequences at different frequencies to capture both coarse high-level plans and precise reactive motions. We extract and fuse hierarchical features from history observations aligned to each frequency for multi-frequency chunk generation, and introduce an entropy-guided execution mechanism that adaptively balances long-horizon planning with fine-grained control based on action uncertainty. Experiments on diverse simulated benchmarks and real-world manipulation tasks show that HiPolicy can be seamlessly integrated into existing 2D and 3D generative policies, delivering consistent improvements in performance while significantly enhancing execution efficiency.